DIFFERENTIAL EQUATIONS 
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Many physical problems, when formulated in mathematical 
forms, lead to differential equations. Differential equations enter 
naturally as models for many phenomena in economics, commerce, 
engineering etc. Many of these phenomena are complex in nature 
and very difficult to understand. But when they are described by 
differential equations, it is easy to analyse them. For example, if the 
rate of change of cost for x outputs is directly proportional to the 
cost, then this phenomenon is described by the differential equation, 

^t^ = k C, where C is the cost and k is constant. The 
dx 
solution of this differential equation is 

C = C e^ where C = C when x = 0. 

6.1 FORMATION OF DIFFERENTIAL EQUATIONS 

A Differential Equation is one which involves one or more 
independent variables, a dependent variable and one or more of 
their differential coefficients. 

There are two types of differential equations: 

(i) Ordinary differential equations involving only one 
independent variable and derivatives of the dependent variable 
with respect to the independent variable. 

(ii) Partial differential equations which involve more than one 
independent variable and partial derivatives of the dependent 
variable with respect to the independent variables. 

The following are a few examples for differential equations: 
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(1), (2) and (3) are ordinary differential equations and 

(4), (5) and (6) are partial differential equations. 

In this chapter we shall study ordinary differential equations 
only. 

6.1.1 Order and Degree of a Differential Equation 

The order of the derivative of the highest order present in a 
differential equation is called the order of the differential equation. 

For example, consider the differential equation 
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+ 7^-4v = 
dx 



The orders of — — , — — and — are 3, 2 and 1 respectively. So 

dx 3 dx 2 dx 

the highest order is 3. Thus the order of the differential equation is 3. 

The degree of the derivative of the highest order present in a 
differential equation is called the degree of the differential equation. 
Here the differential coefficients should be free from the radicals 
and fractional exponents. 

Thus the degree of 
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Example 1 

Write down the order and degree of the following 
differential equations. 
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-logx= 
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(^ iiJ ) VT^ 2 = — 






rfjc 2 

Solution : 

The order and the degree respectively are, 

(i) 1 ; 3 (ii) 2 ; 3 (iii) 2 ; 1 (iv) 3 ; 1 

(v) 1 ; 2 (vi) 2 ; 3 (vii) 2 ; 2 (viii) 1 ; 1 

Note 

Before ascertaining the order and degree in (v), (vi) & (vii) 
we made the differential coefficients free from radicals and fractional 
exponents. 

6.1.2 Family of curves 

Sometimes a family of curves can be represented by a single 
equation. In such a case the equation contains an arbitrary constant 
c. By assigning different values fore, we get a family of curves. In 
this case c is called the parameter or arbitrary constant of the 
family. 

Examples 

(i) y = mx represents the equation of a family of straight lines 
through the origin , where m is the parameter. 

x 2 + y 2 = a 2 represents the equation of family of concentric 
circles having the origin as centre, where a is the parameter. 

y = mx + c represents the equation of a family of straight 
lines in a plane, where m and c are parameters. 
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(iii) 



6.1.3 Formation of Ordinary Differential Equation 

Consider the equation y = mx + ~k (1) 

where m is a constant and X is the parameter. 

This represents one parameter family of parallel straight lines 
having same slope m. 

Differentiating (1) with respect to x, we get, — = m 

dx 

This is the differential equation representing the above family 

of straight lines. 

Similarly for the equation y = Ae 5x , we form the differential 

equation — = 5y by eliminating the arbitrary constant A. 
dx 

The above functions represent one-parameter families. Each 

family has a differential equation. To obtain this differential equation 

differentiate the equation of the family with respect to x, treating 

the parameter as a constant. If the derived equation is free from 

parameter then the derived equation is the differential equation of 

the family. 

Note 

(i) The differential equation of a two parameter family is obtained 
by differentiating the equation of the family twice and by 
eliminating the parameters. 

(ii) In general, the order of the differential equation to be formed 
is equal to the number of arbitrary constants present in the 
equation of the family of curves. 

Example 2 

Form the differential equation of the family of curves 
y = A cos 5jc + B sin 5x where A and B are parameters. 

Solution : 

Given y = A cos 5x + B sin 5x 

-f- = -5A sin5x + 5B cos 5.x; 
dx 
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d 2 y 
dx 2 

— f + 25y = 0. 



-25 (A cos 5x) - 25 (B sin 5x) = -25y 



dx 

Example 3 

Form the differential equation of the family of curves 
y = ae 3x + be x where a and b are parameters. 

Solution : 

= ae 3x + be x 



y 

— = 3ae 3x + be x 
dx 



d 2 y 
dx 2 



= 9ae 3x + be 
dy 



(2)-(l)=> ^f- -y = 2ae 3x 
dx 



(1) 

(2) 

(3) 
(4) 



(3) - (2) 



d 2 y _ dy 
dx 2 dx 



-6ae 



3x 



dy _ 
dx 



[using (4)] 



^Z-4^ + 3y = 
dx 2 dx 



Example 4 

Find the differential equation of a family of curves given 
by y = a cos (mx + b), a and b being arbitrary constants. 

Solution : 

- a cos (mx + b) (1) 



y 

— = -ma sin (mx + b) 
dx 

d 2 y 

dx 2 

d 2 y 

— — + m 2 y = is the required differential equation. 

dx 



m 2 a cos (mx + b) = -m 2 y 



[using (1)] 



Example 5 

Find the differential equation by eliminating the arbitrary 
constants a andZ» from y = a tan x + b sec x. 

Solution : 

y = a tan x + b sec x 
Multiplying both sides by cos x we get, 

y cos x = a sin x + b 
Differentiating with respect to x we get 

/ \ dy 

y (-sin x) + -^- cos * = a cos x 
ax 

=> -ytanx+— =a (1) 

dx 

Differentiating (1) with respect to x, we get 



d 2 y _ dy 
dx 2 dx 



tan x - y sec 2 x = 



EXERCISE 6.1 



1) Find the order and degree of the following : 



(i)x : 



d 2 y ~dy 



dx 



dx 



/■■\ d y r, 
+ y = cosx (n) — —— 5 



( iii) ^--J^L = 
dx V dx 
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> 
V 


3 


d y 
dx 


U x2 y 


2 


= 


(dy\ 

dx 
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ax 
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, d 2 y dy 

1 + f =*-r- 

dx ax 



(viii) 3^-£ +5 
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dx 
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dx 



f d 2 y 
dx 



dy 



dx 

\ J 



— 3y = e* 
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r^, A 



dy 
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2) 



Find the differential equation of the following 
(i) y - mx (ii) y - ex - c + c 2 

6 



(iii) y - mx -\ , where m is arbitrary constant 

m 
(iv) y - mx + c where m and c are arbitrary constants. 

3) Form the differential equation of family of rectangular 
hyperbolas whose asymptotes are the coordinate axes. 

4) Find the differential equation of all circles x 2 + y 2 + 7gx - 
which pass through the origin and whose centres are on 
the x-axis. 

5) Form the differential equation of y 2 = 4a (x + a), where a is 
the parameter. 

6) Find the differential equation of the family of curves 
y - ae 2x + be 3x where a and b are parameters. 

7) Form the differential equation for y - a cos 3x + b sin 3x 
where a and b are parameters. 

8) Form the diffrential equation of y - ae hx where a and b 
are the arbitrary constants. 

9) Find the differential equation for the family of concentric circles 

x 2 + y 2 - a 2 , a is the paramter. 

6.2 FIRST ORDER DIFFERENTIAL EQUATIONS 

6.2.1 Solution of a differential equation 

A solution of a differential equation is an explicit or implicit 
relation between the variables which satisfies the given differential 
equation and does not contain any derivatives. 

If the solution of a differential equation contains as many 
arbitrary constants of integration as its order, then the solution is 
said to be the general solution of the differential equation. 

The solution obtained from the general solution by assigning 
particular values for the arbitrary constants, is said to be a particular 
solution of the differential equation. 
For example, 
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Differential equation 



General solutuion 



Particular solution 



(i) — = sec 2 * 
dx 



(ii) *L = x 2 + 2x 
dx 



y = tan x + c 
(c is arbitrary constant) 

x 3 9 

y = 4- +x 2 + c 



(iii) 



d 2 y 
dx 2 



-9y = 



y 



= Ae 3x + Be~ 3x 



y= tan x - 5 



y =^- + * 2 + 8 



j = 5e 3x -7e _3jr 



6.2.2 Variables Separable 

If it is possible to re-arrange the terms of the first order and 
first degree differential equation in two groups, each containing only 
one variable, the variables are said to be separable. 

When variables are separated, the differentail equation takes 
the form fix) dx + g(y) dy = in which f(x) is a function of x 
only and g(y) is a function of y only. 

Then the general solution is 

\ f (x) dx + \ g (y) dy = c (c is a constant of integration) 

For example, consider x — — y = 

dx 



dy 
x-f- =y 
dx 



dy a x 
=> — = — (separating the variables) 

c dy r (\ x 

=> — = \- k where k is a constant of integration. 

J y J x & 

=> log y = log x + k. 

The value of k varies from -°° to °°. 

This general solution can be expressed in a more convenient 
form by assuming the constant of integration to be log c. This is 
possible because log c also can take all values between -°° and °° 
as k does. By this assumption, the general solution takes the form 



log y - log x = log c 



log (2 ) = log c 

x 



i.e. — = c => y = ex 

x 

which is an elegant form of the solution of the differential equation. 

Note 

(i) When y is absent, the general form of first order linear 

differential equation reduces to — =f(x) and therefore the 

ax 



solution is y = \f (x) dx + c 



(ii) When x is absent , it reduces to -f- = g(y) 

dx 

and in this case, the solution is , s = \dx+ c 

J g(y) j 

Example 6 

Solve the differential equation xdy + ydx = 

Solution : 

xdy + ydx = , dividing by xy we get 

^1.4^ _ n TU _ [dy_ _ ( dx _ c 



+ ^ =0. ThenP+ U 

x J y J 



y x ■ J y J x ' 

log y + log x = log c => xy = c 

Note 

(i) xdy + ydx = => d(xy) = => xy = c, a constant. 
„jc, ydx -xdy f ydx -xdy r ,r. x 

Example 7 

Solve ^ = e 3x+ y 
dx 

Solution : 

dy 3x v ^ 5x^ 

dx e 



je y dy = JV X dx + c 



3x 3x 



Example 8 

Solve (jc 2 — ay)dx = (ax —y 2 )dy 

Solution : 

Writing the equation as 

x 2 dx + y 2 dy = a(xdy + ydx) 
=> x 2 dx + y 2 dy = a d(xy) 

\x 2 dx + \y 2 dy= a\d (xy) + c 

3 3 

=> y + y = a(xy) + c 

Hence the general solution is x 3 + y 3 = 3axy + c 

Example 9 

Solve y(l+x 2 )> dy +x*jl + y 2 dx = 

Solution : 

y-Jl+x 2 dy + x*Jl + y 2 dx =0 [dividing^ Vl+/ V 1 " 1 "/ ] 



dy + , = dx =0 



, = Jy + . i d* = c, 
-» ? 2 dt + ^ \u 2 du = c 



i.e. ? 2 + w 



2 
c or <Jl 



Put 1+y 2 = f 

2ydy = dt 

put l+;t 2 = u 

2xdx = du 



+ y 2 + V1+* 2 



Note : This problem can also be solved by using 

\[f(x)Vf'(x)dx= [/( -" )] " +1 
J n + l 

Example 10 

Solve (sin jc + cos jc) dy + (cos jc — sin jc) dx = 

10 



Solution : 

The given equation can be written as 

, , cos x - sin x , A 

ay + — ax = 

sin x + cos x 



^ ( dy+ r ooss-ms ^ 

J J sin jc + cosjc 

=> 
Example 11 



sin jc + cosx 
j + log(sin x + cos x) = c 



Solve jc-^- + cos v = 0, given v =^f when* = V2 
ax 4 

Solution : 

x dy = -cos y dx 

[sec y Jv = \-k, where k is a constant of integration. 

log (sec y + tan y) + log jc = log c, where A: = log c 
or ^(sec y + tan y) = c. 

When x = -J2 , y = -j ' we nave 

-Jl (sec j + tan J) = c or c = 4l (J2 + 1) = 2 +V2 

The particular solution is jc (sec y + tan y) = 2 + V2~ 

Example 12 

The marginal cost function for producing x units is 
MC = 23+16*: - 3jc 2 and the total cost for producing 1 unit is 
Rs.40. Find the total cost function and the average cost 
function. 

Solution : 

Let C(x) be the total cost function where x is the number of 
units of output. Then 

4R =MC = 23 + 16*-3;t 2 
dx 
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.-. \^dx = \(23+l6x-3x 2 )dx+ k 



C = 23x + Sx 2 - x 3 + k, where k is a constant 
At x= 1, C(x) = 40 (given) 
23(1) + 8(1) 2 - l 3 + k = 40 => k =10 
Total cost function C(x) = Tix + Sx 2 - x 3 + 10 

Average cost function = Total cost function 

x 
_. 23x + 8x 2 -.y 3 +10 

Average cost function =23 + 8jc-^: 2 + 



2 . 10 



^ = 



Example 13 

What is the general form of the demand equation which 
has a constant elasticity of -1 ? 

Solution : 

Let x be the quantity demanded at price p. Then the 
elasticity is given by 
~P dx 
x dp 

Given ^^=-1 ^ dx = ±^ rdx = (± +logk 
x dp x p J x J p 

=> log x = log p + log k, where k is a constant. 

=> log x = log kp => x = kp => p = -\-x 

k 

i.e. p = ex, where c = — is a constant 

k 

Example 14 

The relationship between the cost of operating a 
warehouse and the number of units of items stored in it is 

given by ^ = ax + b, where C is the monthly cost of operating 
the warehouse and x is the number of units of items in storage. 
Find C as a function of x if C = C^, when x = 0. 
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Solution : 

Given ^ = ax + b :. dC = (ax + b) ax 
ax 

\dC = Uax + b)dx + k, (k is a constant) 

=> C = ^-+bx +k, 

when;t = 0, C = C .-. (1) ^ C = |(0) + b(0) + k 

=> fc = C 
Hence the cost function is given by 

C = -|;t 2 + bx + C 

Example 15 

The slope of a curve at any point is the reciprocal of 
twice the ordinate of the point. The curve also passes through 
the point (4, 3). Find the equation of the curve. 

Solution : 

Slope of the curve at any point P(x, y) is the slope of the 
tangent at P(x, y) 

■•■ % =27 => 2 ydy = dx 

|2yjy =^dx+c => y 2 =x + c 

Since the curve passes through (4, 3), we have 

9 = 4 +c =>c = 5 
Equation of the curve is y 2 = x + 5 

EXERCISE 6.2 



1) Solved)-^— r =0 (n)^ = TTI T 

cm) £ = >^ ( lv) xViT7 + yVIT7 £ = 

ctJL Ji J. CL/C 
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2) Solve (i) — = e 2x ' y + x 3 e~ y (ii) (1-e*) sec 2 y dy + 3<?*tan y dx = 

<ix 

3) Solve (i) ^ =2xy + 2ax (ii) x(y 2 + 1) dx + y(x 2 + 1) dy = 

dx 

(iii) (x 2 - yx 2 ) -p- + y 2 + xy 2 = 
<ix 



4) Solve (i) xdy + ydx + 4^/l-x 2 y 2 flfr =0 (ii) ydx-xdy +3x 2 y 2 e x dx-0 

r, c i ^ ^ y 2 + 4y+5 dy , y 2 +y + l _ 

5) Solve (i) -^ = —., (n) -^+^ = 

dx x -2x + 2 Jx x +x+l 

6) Find the equation of the curve whose slope at the point (x, y) is 
3x 2 + 2, if the curve passes through the point (1, -1) 

7) The gradient of the curve at any point (x, y) is proportional to 
its abscissa. Find the equation of the curve if it passes through 
the points (0, 0) and (1,1) 

1 y l 

8) Solve : sin x dy + , dx - 0, given thaty = 2 when x = — 

VI- x 2 2 

9) What is the general form of the demand equation which has an 

elasticity of - n ? 

10) What is the general form of the demand equation which has an 

elasticity of — — ? 

11) The marginal cost function for producing x units is 
MC = e 3x + 1 . Find the total cost function and the average cost 
function, given that the cost is zero when there is no production. 

6.2.3 Homogeneous differential equations 

A differential equation in x and y is said to be homogeneous 
if it can be defined in the form 

~dx~ = g(x,y) where /(*' y) and S(x, y) are 
homogeneous functions of the same degree in x and y. 

dy _ xy dy_ _ x 2 +y 2 dy_ _ x 2 y 

~dx~~ x 2 + y 2, ~dx~~' 2xy '^~x 3 + y 3 
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and t = ^-^ +y 

dx x 

are some examples of first order homogeneous differential equations. 

6.2.4 Solving first order homogeneous differential equations 

If we put v = vx then — = v + x -p- and the differential 

dx dx 

equation reduces to variables sepaerable form. The solution is got 

by replacing — for v after the integration is over. 

x 

Example 16 

Solve the differential equation (jc 2 + y 2 )dx = 2xydy 

Solution : 

The given differential equation can be written as 

dy * 2 + y 2 m 

-£- 2xy {L) 

This is a homogeneous differential equation 

Put y = vx :.^= v +x^- (2) 

dx dx 

Substituting (2) in (1) we get, 

d v _ x 2 + v 2 x 2 _ i +v 2 



V + X 



dx 2x(vx) 2v 

x dy_ _ 1+v 2 v ^ x dv _ 1-v 2 
dx 2v dx 2v 

Now, separating the variables, 

^Tdv=^ or }-^ = |^ +Cl 
1-v x J 1-v J X, x 

-Pi \ 

log ( 1 - v 2 ) = - log x + log c [ \yj-jdx = log/(x)] 

or log (1 - v 2 ) + log* = log c ^> (l-v 2 )* = c 

Replacing v by — , we get 
x 
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( 2 A 
V X J 



x = c or x 2 - y 2 = ex 



Example 
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Solve : 


(x 3 + y 3 )dx = 


- (x 2 y + xy 2 ) dy 






Solution : 
















The 


given equation can be written 


as 








dy 
dx 


. 


x 3 +y 3 








( 




X 


2 y + xy 2 




Put 


y = 


dy 

- vx ;. -f- 

dx 


= v+x^- 
dx 








=> 


V + X 


dv 
dx 


1+v 3 
v + v 2 










=> 


x — 
dx 


= 


1 + v 3 

2 -v - 
v + v 


l-v 2 
- v(v + l) - 


(l-v)(l + v) 
v(v + l) 



(1) 



\-r^—dv = f— dx + c 
J l-v J X 

=> l-r-^- dv =-[— dx + c or f- — — — dv=-\— dx+c 
J 1- v J x J l-v J x 

v + log (l-v) = - log * + c 

Replacing v by — , we get — + log (x - y) = c 

x x 

Example 18 

Solve x^- = y-Jx 2 +y 2 
dx 

Solution : 

now, f- = y-^ x2+ y 2 (i) 

dx x 

p U t y = vx .". ^L=V+X^ 

dx dx 
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(1) 



V + X 



dv 



dv _ vx-yx 2 +v 2 x 2 =v _ f\ 



dx 



+ v 



f dv r , 



dv 



J- 



-J 



+ v 



dx 



+ c, 



or 



i.e. 



log (v + Vl + v 2 )= - log x + log c 
log x (v + Vl + v 2 )= log c 
x (v +Vl + v 2 ) =c 

>' 



+, 1+ 



x 2 



V j 



= c or y + Jx 2 + y 2 = c 



Example 19 

Solve (x + v) rfy + (x — y)dx = 

Solution : 



The equation is 



<iy _ 
dx 



Put v = vx .-. -+- = v + x , 
dx dx 



x-y 

x + y 
dv 



(1) 



we get 



i.e. 



x 



v + x 



dv 



dv 
dx 



^^ or v+x^ 
dx 



dx 

1+v , 1 , 
: Tdv = dx or 

1+V X 



x + vx 

T? +V l or xf 
1 + v J dx 

1 



1-v 

1 + v 

(1-v + v + v 2 ) 
1+v 



f rfv , 1 f 2v , r 1 , 

: T dv + 7T 7 av = afx + c 

J l+v 2J 1 + v J x 

tan _1 v + -=■ log (1 + v 2 ) = -log x + c 



i.e. tan - 



X 

\ J 



+ ilog 



' x + y 



-logx + c 
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tan 



i.e. tan" 



x 
V J 

X 

\ J 



+ -= log (x 2 + y 2 ) - -= log* 2 = -log* + c 
+ ^ log (x 2 + y 2 )=c 



Example 20 

The net profit p and quantity x satisfy the differential 

t . dp 2p 3 -x 3 
equation -f- = — - — ; — • 
ax 3xp 

Find the relationship between the net profit and demand 

given that/? = 20 when x = 10. 

Solution : 



dp 2p 3 - x 3 



-(1) 



dx 3xp 
is a differential equation in x and p of homogeneous type 

r. . dp dv 

Put p = vx .-. — = v + x — 

CUC CIJC 



(1) 



V + X 



dv 
dx 
dv 
dx 



2v -1 

3v 2 



l + v J 






3v' 



3v 2 



"Jv = 



It 



3v : 



"dv 



2v -1 

3v 2 



abc 



l + v jav_ * •'• J l + v J J X 

log (1 + v 3 ) = -log x + log k , where fc is a constant 



log (1 + v 3 ) = log 



k ■ , , ■* k 
i.e. 1 + v-* = — 



Replacing v by — , we get 

=> x 3 + p 3 = kx 2 
But when x = 10, it is given thatp = 20 

(10) 3 + (20) 3 =k(l0) 2 ^k = 90 
p 3 = x 2 (90 - x) is the required relationship. 
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x 3 +p 3 = 90* 2 



Example 21 

The rate of increase in the cost C of ordering and 
holding as the size q of the order increases is given by the 
differential equation 

-j— = j • Find the relationship between C and 

q if C = 1 when q = 1. 

Solution : 

dC C 2 + 2Cq 

dq ~ q 2 U> 

This is a homogeneous equation in C and q 

_, dC dv 

Put c = V( i ••• ~dq~= v + ^ 

... dv v 2 q 2 + 2vq 2 

W =* v + *^ = ^ = v 2 + 2v 

dv - . dv dq 

=> *Tq =v 2 + v = v(v+l) => ^—^= — 

r(v + l) — v rdq 

=> — -, — -7T- dv = — + k , k is a constant 

J v(v + l) J q 

=> f^- f^V= f^+logife, 

J v J v + 1 J <? 

=> log v - log (v + 1) = log q + log k 

=> log — ^— - = log qk or — v — = kq 
b v+1 b ^ v+1 ^ 

C 

Replacing v by ~~~ we get, C = kq(C + q) 

q 

when C = 1 and q = 1 

C = *$(C + ?) => * = ^ 

C = — — is the relation between C and q 

2 
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Example 22 

The total cost of production y and the level of output x 
are related to the marginal cost of production by the equation 
(6x 2 + 2v 2 ) dx - (x 2 + 4xy) dy = 0. What is the relation 
between total cost and output if y = 2 when x = 1? 

Solution : 

Given (6x 2 + 2y 2 ) dx = (x 2 + Axy) dy 

jy_ 6x 2 + 2y 2 

dx x 2 + 4xy 
is a homogeneous equation in x and y. 

Put y = vx.: ^=v+x^ 
dx dx 

,... dv 6x 2 + 2y 2 l+4v , 1 , 

(1) => v+x iilL=— — or — -jdv=— dx 

dx x + 4xy 6-v-zv x 

f — 1 — 4v M 

- — r— 7 Jv = — d* + A:, where k is a constant 

J 6-v-2v J x 

=> -log(6-v-2v 2 ) = log x + log fc = log kx 

1 
6 - v - 2v 



— c\u — z, — o; =^> c : — — - 
4* = (2y 2 + xy - 6x 2 ) 



=> x = c(6x 2 - xy -2y ) where c = — and v = 
when * = 1 and y = 2 , 1 = c(6 - 2 - 8) => c = 



EXERCISE 6.3 

1) Solve the following differential equations 

ax x x dx x x 

r--\ dy xy -2y dy 2 

(in) -A. = (iv) x (y - x) -f- =y l 

dx x - 5xy dx 

2 



</y _ y -2xy dy_ _ W 

~dx~~ x 2 -2xy {Y1) ~dx~~x 2 -y' 
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(vii) (x + y) 2 dx = 2x 2 dy (viii) x— - y + Jx 2 + y 2 

dx 

2) The rate of increase in the cost C of ordering and holding as 

the size q of the order increases is given by the differential 

dC C 2 + q 2 
eqation ~JZ- — o7^ • Find the relatinship between C and 

q if C = 4 when q - 2. 

3) The total cost of production y and the level of output x 
are related to the marginal cost of production by the 

dy 24x 2 -y 2 „„ . , , . 

equation — J — - . What is the total cost function 

dx xy 

if y = 4 when x = 2 ? 

6.2.5 First order linear differential equation 

A first order differential equation is said to be linear when 
the dependent variable and its derivatives occur only in first degree 
and no product of these occur. 

An equation of the form — — + Py = Q, 

dx 

where P and Q are functions of x only, is called a first order 
linear differential equation. 

For example, 

(i) ^L+3y = x 3 ; here P = 3, Q =x 3 
dx 

(ii) -^- + y tan x = cos x, P = tan x, Q = cos x 

dx 

(m)^-x-3y = xe x , P = --, Q = e x 

dx x 

(iv) (1 +x 2 )^ +xy = (l+x 2 f, P =-^r, Q = (1 +x 2 f 
ax \-+x 

are first order linear differential equations. 

6.2.6 Integrating factor (I.F) 

A given differential equation may not be integrable as such. 
But it may become integrable when it is multiplied by a function. 
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Mx 



Such a function is called the integrating factor (I.F). Hence an 
integrating factor is one which changes a differential equation into 
one which is directly integrable. 

[vdx 

Let us show that e 1 is the integrating factor 

for ^+Py =Q (1) 

dx 

where P and Q are function of x. 

f] /■ \ Pdx \ d\ \Pdx A \Pclx 

Now, ^-(ye 1 )=^l e i + v^-(e J ) 
ax ax ax 

dy \vdx \vdx A p 

dx ax J 

dy Wdx fp<fa ,(/y iD , f 

= — e J + ye J P = ( — +Pv)e J 
dx ' dx 

[Prfx 

When (1) is multiplied by e 1 , 

dy \Pdx [Pdx 

it becomes ( — +P y ) e 1 = Qe 1 
dx 

=> JL (ye 1 ) = Q e J 
ax 

Integrating this, we have 

\Pdx f^ \ pdx 

ye ] =JQ^ J dx + c (2) 

pdx 

So e J is the integrating factor of the differential equation. 
Note 

(i) e l °s /») = fix) when f(x) > 

(ii) If Q = in — + Pv = Q, then the general solution is 
dx 
y (I.F) = c, where c is a constant. 

(iii) For the differential equation ~j~ + Px = Q where P and Q 
are functions of y alone, the (I.F) is e' and the solution is 

x(I.F)= \Q(l.¥)dy + c 
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Example 23 



,2^J 



Solve the equation (1 - jc 2 ) -^- - xy = 1 

dx 



Solution : 



,*\ d y 



The given equation is (l—x 2 ) — — xy = 1 

ax 

Ox l-;t 2 l — x 

This is of the form — + Py = Q, 
ax 

-x n 1 



where P = j ; Q = 



l-* 2 ,v ~ l-x 2 

I.F = e ] = e 1 -* = Vl-x 2 
The general solution is, 

y(I.F) = JQ(I.F)ox +c 

y^l-x 2 = \ — 2 yl-x 2 dx + c 
f dx 

y^l-x 2 = sin 1 .* + c 

Example 24 

Solve -r~+ay =e x (whereat -1) 
dx 

Solution : 

The given equation is of the form — + Py = Q 

dx 

Here P = a ; Q = e x 

.-. IF = e } = e ax 
The general solution is 

V (I.F) = JQ(I.F)ox +c 
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y #* = ( e x e ax dx + c = JV a+1) * dx + 



Ja+l)x 
y eax _ £ + c 

a + 1 
Example 25 

Solve cos jc -^- + v sin x = 1 
ax 

Solution : 

The given equation can be reduced to 

dy sin x 1 dfy 

— +;y-^ii-^- =— — or -^- + v tanc = secx 

d* cos* cos* d* 

Here P = tanx ; Q = secx 

tan x dx , 

IF = e } = <? log secx = sec x 

The general solution is 

v(I.F) = j"Q(I. F))dx +c 

y sec x = [sec 2 x dx + c 
y sec x = tan x + c 

Example 26 

A bank pays interest by treating the annual interest as 
the instantaneous rate of change of the principal. A man 
invests Rs.50,000 in the bank deposit which accrues interest, 
6.5% per year compounded continuously. How much will he 
get after 10 years? (Given : e 65 =1.9155) 

Solution : 

Let P(?) denotes the amount of money in the account at time 
t. Then the differential equation governing the growth of money is 

£L =M P= o.065P => f4r = [(0-065) dt + c 
dt 100 J P J 

logJ> = 0.065? + c .-. P = e 0065f e c 
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c l = 50000 



P = Cj e 0065t 
At t = 0, P = 50000. 
(1) => 50000 = Cj e° or 

P = 50000 e° 065f 
At f = 10, P= 50000 e ° 065 x 10 = 50000 e 065 

= 50000 x (1.9155) =Rs.95,775. 
Example 27 

Solve -r- + v cos * = ^ sin 2*: 

ax 2 



-(1) 



Q = -= sin 2x 



Let sin x = t, 
then cos x dx = dt 



Solution : 

Here P = cos x 

[Pd* = fcosxd* = sin* 

I,F= e } = e &mx 
The general solution is 

y (I.F) = JQ(I.F)<& + c 

= — sin 2x. e sin * d* + c 
= [sin * cos*. e siax dx + c 
= Ue' dt + c =e'(t-l) + c 
= e* mx (sin* - 1) + c 

Example 28 

A manufacturing company has found that the cost C of 
operating and maintaining the equipment is related to the 
length m of intervals between overhauls by the equation 

m 2 4^ + 2mC = 2 and C = 4 when m = 2. Find the 
am 
relationship between C and m. 
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Solution : 

Given m 2 ^ + 2mC = 2 or 4^ + — = A 
dm dm m m 

This is a first order linear differential equation of the form 

^+ Pv = Q, where P =— ; Q = -\ 
dx mm 

[Vdm \-2-dm , 7 -, 

I.F = e 1 = e lm = e l °z m = m 2 
General solution is 

C (I.F) = fQ(I.F) dm + k where k is a constant 

Cm 2 = — j m 2 dm + k 

J m 

Cm 2 = 2m + k 

When C = A and m = 2, we have 

16 = 4 + *; =» fc=12 

.-. The relationship between C and m is 
Cm 2 = 2m+ 12 = 2(m + 6) 

Example 29 

Equipment maintenance and operating costs C are 
related to the overhaul interval x by the equation 

x 2dfL _ i 0xC _ _ 10 with C = C whenx =x ft . 
dx u u 

Find C as a function of x. 

Solution : 

dC 1A ~ m „„ dC IOC 10 



jc 



2 ^ -10jcC=-10 or 



This is a first order linear differential equation. 



P=-!°- and Q = -±°r 

x x / 

1_ 

x '^ - " 10 



\Pdx= \-^-dx=-l0\ogx = \og 
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x , 






1 



I.F = e J = e 

x'" 

General solution is 

C(I.F) = [CUFF) dx + k, where k is a constant. 



v io - J 



10 



x 



1 






dx + k or — nr 



10 

11 



f \ A 



V A J 



+ k 



when C = C x = x 



"0 

.10 



A n 



10 

11 



/ \ 



v*° y 



+ fc 



fc = 



10 
11*" 



.-. The solution is 



_C_ 

-.10 



10 

11 



.10 



( 1 1 




c 


10 


v* y 


+ 


A o 


n*; 1 J 


/ 


\ 




10 


1 


1 




11 


} 


r" 


x 11 





1) 



EXERCISE 6.4 

Solve the following differential equations 



(i) 

(ii) 

Cm) 

(iv) 

(v) 

(vi) 

(vii) 



dy 

-f- + y cot x - cosec x 

dx 

— - sin 2x - y cot x 
dx 

dy 

dx 

dy 

dx 



+ y cot x - sin 2c 

+ y cot x = 4x cosec x, if y = when x = 



— - 3y cot x = sin 2c and if y = 2 when x = ^- 
dx I 

x -f- — 3y = x 2 
dx 

dy Ixy i 

~7Z + ~rrZJ = 7Tl 2,2 given that y = when x = 1 

ax i + x (1 + x ) 
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(viii) — - y tan x - e x sec x 
dx 

(ix) logx— +— = sin 2c 
dx x 

2) A man plans to invest some amount in a small saving scheme 
with a guaranteed compound interest compounded continuously 
at the rate of 12 percent for 5 years. How much should he 
invest if he wants an amount of Rs. 25000 at the end of 5 year 
period. (e°- 6 = 0.5488) 

3) Equipment maintenance and operating cost C are related to 

2 dC 



the overhaul interval x by the equation x 2 — - — (b—l)Cx = —ba, 

dx 
where a, b are constants and C = C when x = x Q . Find the 

relationship between C andx. 



4) The change in the cost of ordering and holding C as quantity q 

■ ■ , dC C , 

is given by ~i—= a where a is a constant. 

a 1 q 

Find C as a function of q if C = C when q - q 

6.3 SECOND ORDER LINEAR DIFFERENTIAL 
EQUATIONS WITH CONSTANT COEFFICIENTS 

The general form of linear and second order differential 
equation with constant coefficients is 

a d 2 y +b <fy 

dx 2 dx 

We shall consider the cases where 

(i)/(x) = and fix) = Ke^ 

For example, 



2 + ^-r: + cy =f(x). 



d 2 y dy_ 
dx 2 dx 



(i) 3 if - 5 ^: + fy = (or) 3)T- 5y + 6 y = 



72 , 

(ii) — ^ - 4 -^ + 3v = e 5x (or) (D 2 - 4D + 3)v = e 5 * 
dx dx 
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(iii) ^T + $- - y = 7 (or) (D 2 + D - l)y = 7 
dx ax 

are second order linear differential equations. 

6.3.1 Auxiliary equations and Complementary functions 

^z +b ± 

dx 2 dx 

am 2 + bm + c = is said to be the auxiliary equation This is a 

quadratic equation in m. According to the nature of the roots nij 

and m 2 of auxiliary equation we write the complementary function 

(C.F) as follows. 



For the differential equation, a — 5- + b -f- + cy = f(x), 



Nature of roots Complementary function 


(i) Real and unequal (m l ^ m 2 ) 


. m,x „ m.x 

Ae ■ + Be 2 


(ii) Real and equal (jn l = m 2 =m say) 


(Ax + B) e" 1 " 


(iii) Complex roots (a ±/(3) 


e ax (Acos fix + Bsin fix) 


(In all the cases, A and B are arbitrary constants) 



6.3.2 Particular Integral (P.I) 

Consider (aD 2 + bD + c)y = e^ x 
Let /(D) =aD 2 + bD + c 

Case 1 : If f(k) ^ then X is not a root of the auxiliary 
equation /(m) = 0. 

Rule: P^-Jg^-J^H 

Case 2 : If /(A,) = 0, X satisfies the auxiliary equation/(m) = 0. 
Then we proceed as follows. 

(i) Let the auxiliary equation have two distinct roots m l and m 2 
and let A = m v 

Then f(m) = a(m — m t ) (m — m 2 ) = a(m — X)(m — m 2 ) 
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(ii) Let the auxiliary equation have two equal roots each equal to 
X. i.e. m l = m 2 = X. 

.-. f(m) = a ( m — X) 2 

Rule : ;. P.I = L — y e u = 1*1 e^ 

a(D-l) 2 a 2! 

6.3.3 The General solution 

The general solution of a second order linear differential 
equation is y = Complementary function (C.F) + Particular 
integral (P.I) 



Example 30 

d 2 y _ 5 di 

dx 2 dx 

Solution : 



Solve 3-^-5^-+ 2y = 



The auxiliary equation is 3m 2 - 5m + 2 = 

=> (3m -2) (m- 1) = 

2 
The roots are m, = — and m 9 = 1 (Real and distinct) 

1 3 z 

.-. The complementary function is 

C.F = Ae 3 + Be x 
The general solution is 

2 _ 

y = Ae 3 ' x +Be x 

Example 31 

Solve (16D 2 -24D + 9)v = 

Solution : 

The auxiliary equation is 16m 2 - 24m + 9 = 

(4m -3) 2 = =» m = \, \ 

4 4 

The roots are real and equal 
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TheC.F is(At + B)e 4 

The general solution is y = (Ax + B)e 

Example 32 

Solve (D 2 -6D + 25)v = 

Solution : 

The auxihary equation is m 2 - 6m + 25 = 

m= -b±^-Aac 
2a 
_ 6±n/36-100 _ 6 ±8/ _ , . 4 , 
2 _ 2 _J - 

The roots are complex and is of the form 

a + z'P with a = 3 and |3 = 4 

C.F = e ax (A cos fix + B sin pjc) 

= <? 3x (A cos 4* + B sin Ax) 
The general solution is 

y = e 3x (A cos Ax + B sin 4x) 

Example 33 

Solve -4 -5$^ +6y = e 5 * 
ax ax 

Solution : 

The auxiliary equation is m 2 - 5m + 6 = =>m = 3, 2 
Complementary function C. F = Ae 3x + Be 2 * 



— p 5x _ J_ „5x 

D 2 -5D + 6 6 



P. I = — --> — ^ — 7 e 5x = 4 e 5 



;. The general solution is 
y = C.F + P. I 

j = Ae 3x + Be 2x + ^- 
6 
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rfjc 2 rfjc 



Example 34 

Solve ^-f + 4 ^ + 4y = 2e- 

Solution : 

The auxiliary equation is m 2 + 4m + 4 = => m = -2,-2 
.-. Complementary function is C. F = (Ajc + B)e~ 2x 



P.I = —= — i 2e~ 3x 

D 2 +4D+4 




1 2e"^ = 
(-3) 2 + 4(-3) + 4 


= 2e~ 3x 


The general solution is 




y = C.F +P. I 




y = (Ax + B) e" 2x + 2e" 3j: 




Example 35 





d 2 y . dy 



Solve — y -2^ + 4v = 5 + 3e* 
ax ax 

Solution : 

The auxiliary equation is m 2 -2m + 4 = 

=* m = 2±^Hi = 2±|V3 =1±/ ^ 
C.F = e x (A cos V3* + B sin V3*) 

1 D 2 -2D + 4 4 4 

1 



PJ.-F- 



2D + 4 



3e" 



■jr 



1 3e-*= 3 -€- 



(-l) 2 -2(-l) + 4 
.•. The general solution is 

y = C.F + P. Ij + P.^ 

5 3 
y = e* (A cos -,/3Je + B sin £x) + 4 + i e~ x 
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Example 36 

Solve (4D 2 - 8D+ 3)v = e r 

Solution : 

The auxiliary equation is 4m 2 - 8m + 3 = 

m i = f> m 2= \ 

2 1 

C.F =Ae 2X +Be 2X 

_J > _ 1 



PI = ^2 



4D 2 -8D + 3 4(D-f)(D-I) 

1 i x _ _x_ i x 

4(i--i)(D-i-) e 2 - _ 4 e 2 

.-. The general solution is 
y = C.F. + P. I 

2 i. i 

y = Ae 2 '* + Be 2 " -4^ 2 

J 4 



Example 37 



2 . inn j. isi., - ^ , „-sx 



Solve : (D 2 + 10D + 25)v = ^ + e 

Z* 



Solution : 












The 


auxiliary equation 


is m 2 


+ 10m + 25 = 





=> 


(m + 5) 2 =0 










=> 


m = -5, -5 










.-. C.F = 


(Ax + B) e~ 5x 










PIi 


1 
~ D 2 +10D + 25 


1 Oar 

2 


-*( 


:t)- 


1 
10 


P.I 2 


1 


,— 5x 


1 


. e 5x 




~ D 2 +10D + 25 e 


(D+5)< 





= fy e- 5 ^ = ^- (e~ 5 *) 

.-. The general solution is 

33 



y = C.F + P. Ij + P.L^ 

"10 2 



1 2 

y = (Ax + B) e~ 5x + -^ + 4r e_5x 



Q d = 42 - 4p - 4 -j- + —j- and quantity supplied 



Example 38 

Suppose that the quantity demanded 

dp d^£ 

~dl dt 
Q s = -6 + 8/7 where/? is the price. Find the equilibrium price 

for market clearance. 

Solution : 

For market clearance, the required condition is Q^ = Q s . 
dp_ d 2 p 
dt dt 



42 - 4p - 4 ?tL + -jf- = -6 + 8p 



=> 48-12^-4^ + ^=0 

The auxiliary equation is m 2 -4m-12 = 
=> m = 6 , -2 

C.F. = Ae 6f + Be 2 ' 

RI= tf-4'D-12 ( - 48)e0 ' = ^l2 ( - 48) = 4 
.-. The general solution is 

p = C.F. +P. I 

p = Ae 6t + Be" 2t + 4 

EXERCISE 6.5 

1) Solve : 

d2 y ^dy « A n ,-s d2 y dy 
(i) — r - 10 -f + 24y = (u) — - + -f = 
ax ax ax ax 

,-n d2 y , r s d2 y a dy 

(in) — - + Ay = (iv) —+4-^4 

ax ax ax 
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2) 



3) 



Solve : 

(i) (3D 2 + 7D - 6)v = 
(iii) (3D 2 -D+ l)y = 
Solve : 



(ii) (4D 2 - 12D + 9)y = 



(i) 
(ii) 
(iii) 
(iv) 



(D 2 - 13D + 12)y = e" 2 * + 5e x 
(D 2 -5D + 6)y = e- x + 3e" 2x 
(D 2 -14D + 49)y = 3 + e 7x 
(15D 2 -2D-l)y = e 3 



4) 



Suppose that Q d = 30-5P + 2^ + — T and Q s 
the equilibrium price for market clearance. 



6 + 3P. Find 



EXERCISE 6.6 
Choose the correct answer 



1) 



The differential equation of straight lines passing through the 
origin is 



i \ dy „. dy x 

{ * )X dx= y (b) dx = 7 



t \ d y n <a\ dy 1 
(C) Jx = ° (d)X Jx = 7 



2) The degree and order of the differential equation 



d y 



- are 



3) 



dx 1 ' V dx 
(a) 2 and 1 (b) 1 and 2 (c) 2 and 2 (d) 1 and 1 

The order and degree of the differential equation 



dy 
dx 



„ d 3 y _ d 2 y dy 
-3 — j- + 7 — j- + — — = x + log x are 
<ix <ix ax 



(a) 1 and 3 (b) 3 and 1 (c) 2 and 3 (d) 3 and 2 



4) The order and degree of 



1 + 



(dy} 

dx 



J 



d y 
dx 



are 



(a) 3 and 2 



(b) 2 and 3 
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(c) 3 and 3 (d) 2 and 2 



5) The solution of x dy + y dx = is 

(a) x + y - c (b) x 2 + y 2 - c (c) xy = c (d) y = ex 

6) The solution of x dx + y dy - is 

(a) x 2 + y 2 - c (b) ~ = c (c) x 2 - y 2 - c (d) xy - c 

7) The solution of — — = e x ~ y is 

ax 

(a) e y e x - c (b) y = log ce x 

(c) y - log(e x +c) (d) e x+y - c 

8) The solution of —f- - ke~' (k is a constant) is 

dt 

k 
(a) C - — = p (b) p - ke' + c 

(c)f = log-^^ (d)? = log c /7 

9) In the differential equation (x 2 - y 2 ) dy = Ixy dx, if we make 
the subsititution y = vx then the equation is transformed into 

2 1 2 

< \ 1+V 7 fi?X /is 1 — v 7 fi?X 

(a) r dv = — (b) n 2s J v = — 

v + v x v(l + v ) X 



, x ^v dx , A , dv dx 

(C) 2 ! = (d) 2 = 

v -1 x 1+V X 

dy / 2" 2~ 

10) When y = vx the differential equation x — = y + Jx +y 

ax 
reduces to 

dv a x vdv a x 

(a) / 2 ; = ^ (b) / 2 ; = ^ 

Vv -1 x Vv +1 x 

dv r i Y vdv r i Y 

(c) / 2 ; = — (d) 1 2 = ^ 

a/v +1 x a/1-v x 

11) The solution of the equation of the type — — + Py = 0, (P is a 
function of x) is given by 

(a)ye ,Wl =c (b)yjPdx=c 

(c)xe iPdx =y (d)y = cx 
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dx 

12) The solution of the equation of the type ~r~ + Px = Q (P and Q 

are functions of y) is 

(a) y = |Q e {Vdx dy +c (b) ye SPdx = JQ e jWx Ox-K; 

(c) x e JP * = J Q e lw> ' dy +c (d) x e jp * =(Q e |PA dx +c 

1 3) The integrating factor of x — - y - e x is 

ax 

(a) logx (b) e T (c) 1 (d) ^ 

14) The integrating factor of (1 + x 2 ) — + xy - (1 + x 2 ) 3 is 

ax 



(a) TlT^ 7 (b) log (1 + x 2 ) (c) e tan " * (d) log (tan " x) 

15) The integrating factor of 1 = x 3 is 

dx x 

(a) 2 logx (b) e* 2 (c) 3 log(x 2 ) (d) x 2 

16) The complementary function of the differential equation 
(D 2 -D)y = e x is 

(a) A + B e x (b) (Ax + B)e x (c) A + Be~ x (d) (A+Bx)e* 

17) The complementary function of the differential equation 
(D 2 -2D + l)y = e 2x is 

(a) Ae x + Be~ x (b) A + Be x (c) (Ax + B)e x (d) K+Be~ x 

18) The particular integral of the differential equation 

dx dx 



2 -5 -f- + 6y = e 5x is 



5 x 5 X 5 X 

(a) ^- (b) ^- (c) 6e 5x (d) ^- 



19) The particular integral of the differential equation 

+ 9y = e 3x is 



±_L z d y , n, U; 



,2 

0~x 0*x 

(a) ^1 (b) i^l (c) J^l (d) 9e 3 * 

<i y 
20) The solution of — — -y - is 

(a) (A + B)e x (b) (Ax + B)<r* (c) Ae x + — (d) (A+Bx)e" x 

e 
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INTERPOLATION AND 
FITTING A STRAIGHT LINE 



7 



7.1 INTERPOLATION 

Interpolation is the art of reading between the lines in a table. 
It means insertion or filling up intermediate values of a function from 
a given set of values of the function. The following table represents 
the population of a town in the decennial census. 



Year : 


1910 


1920 


1930 


1940 


1950 


Population : 


12 


15 


20 


27 


39 


(in thousands) 













Then the process of finding the population for the year 1914, 
1923, 1939, 1947 etc. with the help of the above data is called 
interpolation. The process of finding the population for the year 
1955, 1960 etc. is known as extrapolation. 

The following assumptions are to be kept in mind for 
interpolation : 

(i) The value of functions should be either in increasing 
order or in decreasing order. 

(ii) The rise or fall in the values should be uniform. In other 
words that there are no sudden jumps or falls in the 
value of function during the period under consideration. 

The following methods are used in interpolation : 

1) Graphic method, 2) Algebraic method 

7.1.1 Graphic method of interpolation 

Let y = fix), then we can plot a graph between different 
values of x and corresponding values of v. From the graph we can 
find the value of y for given x. 
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Example 1 

From the following data, estimate the population for the 
year 1986 graphically. 

Year : 1960 1970 1980 1990 2000 

Population : 12 15 20 26 33 

(in thousands) 

Solution : 



34 -- 
32 -- 
30 "- 



2 28 
I 26 

•S 24 

a 

•2 22 

3 20 
& 18 



16 

14 
12 
10 



(1990,26) 

(1986,24) 




(2000, 33) 



'(I960, 12) 

h- 



— I — 
1960 



+ 



19|86 



+ 



+ 



1970 



1980 1990 
year 



2000 2010 



->X 



From the graph, it is found that the population for 1986 was 24 
thousands 

Example 2 

Using graphic method, find the value of y when x = 27, 
from the following data. 

x : 10 15 20 25 30 

y : 35 32 29 26 23 
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Solution : 



35 
34 
33 
32 
31 
30 
29 
28 
27 
26 
25 
24 
23 



((10,35) 



,(15,32) 




(20, 29) 



(25, 26) 
(27,24.8) 



(30, 23) 



-+- 



+ 



->x 



10 



15 



20 



25 27 30 



The value of y when x = 27 is 24.8 

7.1.2 Algebraic methods of interpolation 

The mathematical methods of interpolation are many. Of these 
we are going to study the following methods: 

(i) Finite differences 

(ii) Gregory-Newton's formula 

(iii) Lagrange's formula 

7.1.3 Finite differences 

Consider the arguments x Q , x v x 2 , ... x n and the entries 
y , y v y 2 , ..., y n . Here y =f(x) is a function used in interpolation. 

Let us assume that the jc-values are in the increasing order 
and equally spaced with a space-length h. 
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Then the values of x may be taken to be x Q , x Q + h, x Q + 2/z, 
... x Q + nh and the function assumes the values f(x Q ), f(x +h), 
f(x Q + 2/0, ..., f(x Q +nh) 

Forward difference operator 

For any value of x, the forward difference operator A(delta) 
is defined by 

Af(x)=f(x+h)-f(x). 

In particular, Av = A/(x Q ) =f(x Q +h) -f(x Q ) = y r y Q 

Af(x), A\f(x+h)], A \f(x+2h)], ... are the first order 
differences off(x). 

Consider A 2 f(x)= A[A{f(x) } ] 

= A\f(x+h)-f(x)] 

= A\f(x+h)] - A\f(x)] 

= \f(x+2h) -f(x+h)] - \f(x+h) -f(x)] 

= f(x+2h)-2f(x+h)+f(x). 

A 2 f(x), A 2 \f(x+h)], A 2 \f(x+2h)] ... are the second order 
differences of/(^:). 

In a similar manner, the higher order differences A 3 f(x), 
A 4 f(x),...A"f(x), ... are all defined. 

Backward difference operator 

For any value of x, the backward difference operator V(nabla) 
is defined by 

V/(x) = f(x)- f(x -h) 

In particular, Vy„ = V/(x n ) =fix n ) -fix n - h) = y-y n _ x 

Vfix), V\fix+h)], V\fix+2h)], ... are the first order differences 
of/(x). 

Consider V 2 /(x)= V[V|/(x)}] = V[f(x) -fix-h)] 

= V[fix)]-V\fix-h)] 

= fix)-2fix-h)+fix-2h) 
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V 2 /(;t), V 2 \f(x+h)], V 2 \f(x+2h)] ... are the second order 
differences of fix). 

In a similar manner the higher order backward differences 
V 3 /0O, V 4 f(x),...V n f(x), ... are all defined. 

Shifting operator 

For any value of x, the shifting operator E is defined by 
E[f(x)] =f(x+h) 
In particular, E(y ) = E[/(x )] =f(x Q +h) = y l 
Further, E 2 [f(x)] = E[E{f(x)] = E\f(x+h)] =f(x+2h) 
Similarly E 3 \f(x)] =f(x+3h) 
In general E" [f(x)] =f(x+nh) 
The relation between A and E 

We have Af(x ) = f(x+h) -fix) 
= E f{x) -f{x) 
Afix) =iE-l)fix) 
=} A =E-1 

i.e. E = 1+ A 

Results 

1) The differences of constant function are zero. 

2) If/(.*0 is a polynomial of the n th degree in x, then the n th 
difference of fix) is constant and A" +l fix) = 0. 

Example 3 

Find the missing term from the following data. 

x : 1 2 3 4 

f(x) : 100 -- 126 157 

Solution : 

Since three values of fix) are given, we assume that the 
polynomial is of degree two. 
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Hence third order differences are zeros. 
=* A 3 \f(x )] = 
or A 3 (y ) = 

.-. (E-l) 3 y =0 (A = E-1) 

(E 3 - 3E 2 + 3E - 1) y Q = 

157 -3(126) + 3^-100=0 

.-. y, = 107 

i.e. the missing term is 107 

Example 4 

Estimate the production for 1962 and 1965 from the 
following data. 

Year : 1961 1962 1963 1964 1965 1966 1967 

Production: 200 -- 260 306 -- 390 430 
(in tons) 

Solution : 

Since five values of f(x) are given, we assume that polynomial 
is of degree four. 

Hence fifth order diferences are zeros. 
••• A 5 [f(x Q )]= 
i.e. A 5 (y ) = 
.-. (E-l) 5 (y ) = 
i.e. (E 5 - 5E 4 + 10E 3 - 10E 2 + 5E - 1) v =0 

y 5 -5y 4 +10y 3 -10y 2 + 5y 1 -y =0 

390 - 5v 4 + 10(306) - 10(260) + 5y, - 200 = 

=* Vl -v 4 = -130 (1) 

Since fifth order differences are zeros, we also have 

A 5 [f(x 1 )]= 
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i.e. A 5 ^) =0 

i.e. (E- l) 5 y x = 

(E 5 -5E 4 + 10E 3 -10E 2 + 5E-l)yj =0 

y 6 -Sy s +10y 4 -10y 3 + 5y 2 -y 1 =0 

430 - 5(390) + 10y 4 - 10(306) + 5(260) -y, = 

=> 10^-^ = 3280 (2) 

By solving the equations (1) and (2) we get, 

Vj = 220andv 4 = 350 

.-. The productions for 1962 and 1965 are 220 tons and 350 tons 
respectively. 

7.1.4 Derivation of Gregory - Newton's forward formula 

Let the function y =f(x) be a polynomial of degree n which 
assumes (n+1) values f(x^),f(x 1 ),f(x 2 )...f(x^, where^ , x v x 2 , ... 
x n are in the increasing order and are equally spaced. 

Let Xj -x =x 2 — x l = x 3 -x 2 = ... =x n -x nl = h (a positive 
quantity) 

Here f(x Q ) =y Q , /(x x ) =y v ...f{x n )=y n 

Now fix) can be written as, 

f(x) = a + a x (x - x Q ) + a 2 (x-x )(x-x x ) + ... 

+a n (x-x ) (x-x x )... (x-x nl ) (1) 

When x = x Q , (1) implies 

f(x Q ) =a or a = y Q 

When x = x v (1) => 

f(x x ) =a Q + a l (x l -x ) 

i.e. y 1 =y Q + a l h 

yi-y _ _ A y 



•• a l = h 

When x = x 2 , (1) => 
f(x 2 ) =a Q + a l (x 2 - 


-x )+a 2 (x 2 - 


- x ) (x 2 



x x ) 
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y 2 =y a + I ^ L m + a 2 (2h)(h) 

2h 2 a 2 =y 2 -y Q - 2Ay 

= y 2 -yo~ 2 0vi - y ) 

= y 2 -2 yi +y Q = A 2 y Q 



2 2! h 2 
In the same way we can obtain 

. - ** a = * 

3 3! h 3 ' 4 4!/ ? 4 """"" nlh" 
substituting the values of a Q , a v ..., a n in (1) we get 

h {X Xq)+ 2! h 



_ A y _ A y _ A" y Q 

% r* I I S ' A ,11 7 4 ' * * * ' n 



f(x) =y n + -j^- (x - x ) + tttyC* - x ) (x - x x ) + 



A" y 
+ —rr^( x ~ x o) ( x ~ x v> ••• ( x ~ x n-0 ( 2 ) 

nlh 

x — x 
Denoting — r^by u, we get 
h 

JC JC f. — flli 

x — x i = (x - x ) - (x i - x ) = hu - h = h{u—\) 

x - x 2 = (x - x ) - (x 2 - x ) = hu - 2h = h(u-2) 

x - x 3 = h (u - 3) 
In general 

x-x nA =h{u-(n-l)} 
Thus (2) becomes, 

ft \ u A u(u —1) , 

f( x )=yo + T\ A yo + —2T- A \ + - 



w(w-l)(w-2)...(w-n-l) A „ 

+ j A "v 

n ! 

where u = — t—^-. This is the Gregory-Newton's forward 
h 
formula. 
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Example 5 

Find y when jc = 0.2 given that 

x : 1 2 3 4 

y : 176 185 194 202 212 

Solution : 

0.2 lies in the first interval (x , x x ) i.e. (0, 1). So we can use 
Gregory-Newton's forward interpolation formula. Since five values 
are given, the interpolation formula is 

u A w(w-l) ., w(w-l)(w-2) 

y = y + 11 Ay Q +—^A 2 y Q + ^ A\ 

u(u-l)(u-2)(u-3) .. , x-x n 

+ — A 4 y„ where u= — : — 

4! Jo h 

Here h = 1, x = and x = 0.2 
0.2-0 no 

.-. U = : = 0.2 

The forward difference table : 

x y Ay A 2 y A 3 y A 4 y 



176 



9 



1 185 ml 

2 194 -1 4 

3 202 2 

4 212 

, y =176+ 02 (9) + ^Z1) (0 ) 

(0.2X0.2 -1X0.2 -2) (0.2X0.2- 1X0.2 -2X0-2-3) 

= 176 + 1.8-0.048-0.1344 
= 177.6176 
i.e. when jc = 0.2, v= 177.6176 
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Example 6 

If j 75 = 2459, j 80 = 2018, j 85 = 1180 and 
j 90 = 402 findj 82 . 

Solution : 

We can write the given data as follows: 

x : 75 80 85 90 

y : 2459 2018 1180 402 

82 lies in the interval (80, 85). So we can use Gregory- 
Newton's forward interpolation formula. Since four values are given, 
the interpolation formula is 

u w(w-l) u(u-l)(u-2) 

y = yo + T\ Ay ° + 2! y ° 3l y ° 

where u = 



h 
Here h = 5, x = 75 x = 82 

82-75 7 . . 
,.,= ^ = -=1.4 

The forward difference table : 

x y Ay A 2 y A 3 y 



457 



75 


2459 -441 




80 


2018 _ 83g 


-397 


85 


1180 _ 778 


60 


90 


402 




y 


= 2459 +^y 


(-441) + 



^^(-397, 

+ 1.4(1.4-l) (1 .4-2) (457) 



= 2459-617.4- 111.6-25.592 
= 1704.408 when x = 82 
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Example 7 

From the following data calculate the value of e 1 J5 
x : 1.7 1.8 1.9 2.0 2.1 

e* : 5.474 6.050 6.686 7.389 8.166 

Solution : 

Since five values are given, the interpolation formula is 
u . w(w-l) ., u(u-Y)(u-2) 



u(u-l)(u-2)(u-3) 

4! A} 



v o 



where u = 

n 

Here ^ = 0.1, * = 1.7 % = 1.75 
1.75-1.7 _ 0.05 n c 

•'• M_ o.i ~ o.i _u - 3 

The forward difference table : 

x y Ay A 2 y A 3 y A 4 v 

1.7 5.474 

0.576 

1.8 6.050 0.060 „„„„ 

0.636 0.007 

1.9 6.686 0.067 „ AAa 

0.703 °- 007 

2.0 7.389 0.074 

0.777 

2.1 8.166 

,y = 5.474 + M(0.576) + ^^ (0.06) 
+ 0.5(0.5- 3 l)(0.5-2) (0()07) 

= 5.474 + 0.288 - 0.0075 + 0.0004375 
:.y =5.7549375 when* =1.75 
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Example 8 

From the data, find the number of students whose height 
is between 80cm. and 90cm. 

Height in cms x : 40-60 60-80 80-100 100-120 120-140 

No. of students y : 250 120 100 70 50 

Solution : 

The difference table 

x y Ay A 2 y A 3 y A 4 y 

Below 60 250 

Below 80 370 12 ° -20 

100 10 

Below 100 470 -30 tn 20 

70 10 

Below 120 540 -20 

Below 140 590 5 ° 

Let us calculate the number of students whose height is less 
than 90cm. 

Here ,=90 u = ^ = ^ = 1 .5 

y(90) = 250 +(1.5)(120) + (L5)( ^f~ l) (-20) 
{ (1.5)(1.5-l)(1.5-2) (1.5X1-5 -1X1.5 -2)(1.5-3) 

= 250 + 180 - 7.5 + 0.625 + 0.46875 
= 423.59 ~ 424 
Therefore number of students whose height is between 
80cm. and 90cm. is y(90) - y(80) 
i.e. 424-370 = 54. 

Example 9 

Find the number of men getting wages between Rs.30 
and Rs.35 from the following table 
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Wages 


X 


: 20-30 30-40 


40-50 


50-60 


No. of men 


y 


: 9 




30 


35 


42 


Solution : 














The difference table 










X 




y 


Ay 




A 2 y 


A 3 y 


Under 30 




9 


1f\ 








Under 40 
Under 50 




39 

74 


30 

35 

A ^ 




5 
7 


2 


Under 60 




116 


42 









Let us calculate the number of men whose wages is less 
than Rs.35. 

t- oc ^ — ^o 35 — 30 nc 

For * = 35 , w = — : — = — 77; — =0.5 

h 10 

By Newton's forward formula, 

(>xk\ n. (°- 5 ),am . (0-5X0-5 -D ,^ 
y(35) = 9+— j— (30) + — (5) 

, (0.5X0.5 -1X0.5- 2) ^ 
3! l J 

= 9+ 15-0.6 + 0.1 

= 24 (approximately) 

Therefore number of men getting wages between 

Rs.30 and Rs.35 is y(35) - y(30) i.e. 24-9=15. 

7.1.5 Gregory-Newton's backward formula 

Let the function y = fix) be a polynomial of degree n which 
assumes (n+1) values f{x^),f{x^,f{x^), ...,f(x n ) where x Q , x v x 2 , 
..., x n are in the increasing order and are equally spaced. 

LetXj - %„ = % 2 — Xy = x 3 — x 2 = ... x n — jc j = h (a positive 
quantity) 
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Here/(x) can be written as 

f(x) =a Q + a^x-xj + a 2 (x-x n ) (x-x nl ) + ... 

+ a n (x-x n ) (x-x nl ) ... (x-Xj) (1) 

Whenx = x n , (1) => 

fix ) = a„ or a =y 

■I v n' J n 

When x = x n _ 1 , (1) => 

/(■Vi)= a o + «iCvr*„) 
or j„_! =y n + a l (-h) 



or 
Whe 


fl = y, y„-i ^ fl 

n x = x„_ 2 , (!)=> 


vy„ 



/(-*„_ 2 ) = A + «1 ( X «-2 - *„) + ^2 ( X n-2 ~ X r) ( X n-2 ~ *„-l) 

y„- 2 = y„ + ^ (- 2/? ) + « 2 (- 2/? ) HO 

2^ 2 =^- 2 -y„) + 2Vj„ 

- y n _ y + 2(y — y .) 

J n— 2 J n— 1 *^n *^n 

• (3 = — 

2 2lh 2 
In the same way we can obtain 

V 3 v V 4 v V"y 

•'• f(X) =y " + ^t {X ~ X " ] + \^ {X ~ X n^ X - X n-l) + - 

+ ^^ (x-x n ) (x-x n _ x ) ... (x-x x ) (2) 

n ! 

x — x 
Further, denoting -by u, we get 

x-x = h 

n u 

51 



x-x , = (x-x ) (x -x ,) = hu + h = h(u+l) 

x-x , = (jc-jc ) 6c -jc ) = hu + 2h = h(u+2) 

x-jc n3 = h(u+3) 
In general 

x_x „^ = h(u+k) 
Thus (2) becomes, 

f( x ) = y + — Vy + V 2 y + ... 

«(w + l)...{«+(n-l)} x-x„ 

+ V"y where w = - 

n\ n h 

This is the Gregory-Newton's backward formula. 

Example 10 

Using Gregory-Newton's formula estimate the 
population of town for the year 1995. 

Year x : 1961 1971 1981 1991 2001 

Population y : 46 66 81 93 101 

(in thousands) 

Solution : 

1995 lies in the interval (1991, 2001). Hence we can use 
Gregory-Newton's backward interpolation formula. Since five 
values are given, the interpolation formula is 

M w(w+l) _, u(u + l)(u + 2) , 

y =y 4 +jiVy 4 + —2^v 2 y 4 + 3, v 3 y 4 

u(u + l)(u + 2)(u+3) —. . x-x, 

+ — , , VV where u = — —=- 

4! M h 

Here h = 10, x 4 = 2001 x = 1995 

1995-2001 n( . 
:. u = ttj = -0.6 
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The backward difference table : 

x y Vy V 2 y V 3 y V 4 y 

1961 46 

1971 66 20 -5 

1981 81 15 -3 a 



1991 93 ! ? -4 



1 



2001 101 



8 



(-0.6) (-0.6X-0.6+1) 
... y =101+1^(8)+^ ^ -(-4) 

(-0.6)(-0.6+l)(-0.6+2) 

3' 
(-0.6X-0-6 + 1X-0.6+ 2X-0.6+ 3) 

4~! 
= 101-4.8+0.48+0.056+0.1008 .-. y = 96.8368 
i.e. the population for the year 1995 is 96.837 thousands. 
Example 11 

From the following table, estimate the premium for a 
policy maturing at the age of 58 

Age x : 40 45 50 55 60 

Premium y : 114.84 96.16 83.32 74.48 68.48 

Solution : 

Since five values are given, the interpolation formula is 



V 4 y 





y 


= 3^4+ IT 


Vy 4 +...+- 


4! 




where 


u 


58-60 

5 


= -0.4 






The backward difference table : 






X 




y 


Vy 


V 2 y 


V 3 y 


40 




114.84 


-18 68 






45 




96.16 


-12.84 


5.84 


-1.84 


50 




83.32 


-8 84 


4.00 


-1.16 


55 




74.48 


-6.00 


2.84 




60 




68.48 









0.68 
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... , =68 .4 8+ <-M ( -6) + ( -°- 4 f- 6) (2.84) 

+ (-0.4)(0.6)(1.6) (116) + (-0.4X0.6)(1.6)(2.6) (068) 

= 68.48 + 2.4 - 0.3408 + 0.07424 - 0.028288 
y =70.5851052 i.e. y~ 70.59 
.-. Premium for a policy maturing at the age of 58 is 70.59 

Example 12 

From the following data, find y when x = 4.5 
jc : 1 2 3 4 5 

y : 1 8 27 64 125 

Solution : 

Since five values are given, the interpolation formula is 

v =3. 4+ fiv 3 , 4+ ... + " ( " +1)( " 4 H ; 2X " +3 > vx 

V — V 

where w = 



h 

Here w = — —. — = -0.5 

The backward difference table : 

jc j Vy V 2 j V 3 j V 4 j 



1 1 



7 



28 19 12 6 

3 27 „ 18 , 

51 o 

4 64 ,. 24 

ol 

5 125 

(-0^ (-05X05^ i (-0.5)(0.5)(1.5) ^ 
v = 125+ — j — (61)+ (24) + (6) 

y = 91.125 when x = 4.5 
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7.1.6 Lagrange's formula 

Let the function y =f(x) be a polynomial of degree n which 
assumes (n + 1) values /(* ),/(*j),/(.x 2 ) ...f(x n ) corresponding to 
the arguments x Q , x v x 2 , ... x (not necessarily equally spaced). 

Here/(x ) =y ,f(x l ) =y v ...J(x n )=y n . 

Then the Lagrange's formula is 

(x-x 1 )(x-x 2 )...(x-x n ) 
JW -yo(x -x l Xx -x 2 )...(x -x n ) 

(x-x Q )(x-x 2 )...(x-x n ) 
\%1 ~ Xq)\X\ ~ x 2 )...\x l — X n ) 

(x-x )(x-x l )...(x-x n _ l ) 
+ "" +y "(x n -x )(x n -x 1 )...(x n -x n _ l ) 

Example 13 

Using Lagrange's formula find the value of y when 
x = 42 from the following table 

x : 40 50 60 70 

y : 31 73 124 159 

Solution : 

By data we have 

x Q = 40, x 1 = 50, x 2 = 60, x 3 = 70 and x = 42 
y = 31, yi = 13,y 2 =124,y 3 =159 
Using Lagrange's formula, we get 

(x-x 1 )(x-x 2 )(x-x i ) 

V -^0 — -^i A -^0 — -^2 A -^0 _ "*3 / 
I \ *^n / \ 9 / \ ~\ ) 

\X\ — X^)\X\ — X 2 )\Xy — x^) 

(x - x )(x - x^ix - x 3 ) 
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(x-x )(x-x l )(x-x 2 ) 
(-8X-18X-28) „„ (2)(-18)(-28) 



y(42) - 31 (_io)(_20)(-30) + 73 (10)(-10)(-20) 

(2)(-8)(-28) (2)(-8)(-18) 

+ 124 (20X1 OX- 10) +159 (30X20X10) 

= 20.832 + 36.792 - 27.776 + 7.632 
y = 37.48 

Example 14 

Using Lagrange's formula find y when x = 4 from the 



following table 






x : 




3 5 6 8 


y : 




276 460 414 343 110 


Solution : 






Given 






x = 


o, 


x x = 3, x 2 = 5, x 3 = 6, x 4 = 8 and* = 4 



v = 276, y, = 460, y 2 = 414, v 3 = 343, y 4 = 110 
Using Lagrange's formula 

( JC - JCj )( JC - X, )(.*; - JC3 )( JC - .x; 4 ) 

v-^0 _ -^1/V-^O _ ^2/V-^O _ "^3 A-^0 — -^4 / 

(x- x )(x - x 2 )(x - x 3 )(x -x 4 ) 
\X\ — ^o A-^i — %2 A-^i — -^3 A -^i — -^4/ 

(x- x Q )(x- x x )(x- Xj)(x- x 4 ) 
v -^2 "^0 A -^2 -^1 A -^2 ^3 A -^2 "^4 / 

(*-*„)(*-*!)(*- * 2 )(*-jc 4 ) 
v -^3 -^0 A -^3 -^1 A ^3 %2 A-^3 -^4 / 

(jc-jc )(*-* 1 )(*-* 2 )(*-* 3 ) 

V -^4 — -^0 A "*4 — -^1 A "*4 — -^2 A -*4 — -^3 ) 

56 



„, (1)(-1)(-2)M) (4)(-l)(-2)M) 

Z/b (-3)(-5)(-6)(-8) + 4bU (3)(-2)(-3)(-5) 

+ 414J^l=t + 343- (4X1)( " 1)( " 4) 



+ 110 



(5)(2)(-l)(-3)- rj ' tJ (6)(3)(l)(-2) 
(4)(l)(-l)(-2) 



(8)(5)(3)(2) 

= -3.066 + 163.555 + 441.6 - 152.44 + 3.666 
y =453.311 



Example 15 




Using Lagrange's formula find j(ll) 
following table 


x : 6 


7 10 12 


y : 13 


14 15 17 


Solution : 




Given 





from the 



x Q =6, x l = 7, x 2 = 10, x 3 = 12 and x = 11 
y = 13, yi= 14,v 2 = 15,v 3 = 17 
Using Lagrange's formula 

_ (4)(1)(-1) ^ ^ (5)(1)(-1) 



'(-l)(^)(-6) T ^(l)(-3)(-5) 

(5X4X-1) (5X4X1) 

+ 1:> (4)(3)(-2) +1/ (6X5)(2) 
= 2.1666 - 4.6666 + 12.5 + 5.6666 
y = 15.6666 

EXERCISE 7.1 

1) Using Graphic method, find the value of y when x = 42, from 
the following data. 

x : 20 30 40 50 

y : 51 43 34 24 
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2) The population of a town is as follows. 

Year x: 1940 1950 1960 1970 1980 1990 

Population y : 20 24 29 36 46 50 

(in lakhs) 

Estimate the population for the year 1976 graphically 

3) From the following data, find f(3) 

x : 1 2 3 4 5 

f(x) : 2 5 - 14 32 

4) Find the missing term from the following data. 

x : 5 10 15 20 25 

y : 7 11 14 - 24 32 

5) From the following data estimate the export for the year 2000 
Year x : 1999 2000 2001 2002 2003 
Export y : 443 - 369 397 467 

(in tons) 

6) Using Gregory-Newton's formula, find y when x = 145 given 
that 

x : 140 150 160 170 180 

y : 46 66 81 93 101 

7) Using Gregory-Newton's formula, find y(8) from the following 
data. 

x : 5 10 15 20 25 

y : 7 11 14 18 24 32 

8) Using Gregory-Newton's formula, calculate the population for 
the year 1975 

Year : 1961 1971 1981 1991 2001 

Population : 98572 132285 168076 198690 246050 

9) From the following data find the area of a circle of diameter 
96 by using Gregory-Newton's formula 

Diameter x : 80 85 90 95 100 

Area y : 5026 5674 6362 7088 7854 

58 



10) Using Gregory-Newton's formula, find y when x - 85 
x : 50 60 70 80 90 100 

y : 184 204 226 250 276 304 

11) Using Gregory-Newton's formula, find y(22.4) 
x : 19 20 21 22 23 

y : 91 100 110 120 131 

12) From the following data findy(25) by using Lagrange's formula 
x : 20 30 40 50 

y : 512 439 346 243 

13) If f{0) = 5, /(l) = 6, /(3) = 50, /(4) = 105, find /(2) by using 
Lagrange's formula 

14) Apply Lagrange's formula to find y when x - 5 given that 
x : 1 2 3 4 7 

y : 2 4 8 16 128 



7.2 FITTING A STRAIGHT LINE 

A commonly occurring problem in many fields is the necessity 
of studying the relationship between two (or more) variables. 

For example the weight of a baby is related to its age ; the 
price of a commodity is related to its demand ; the maintenance 
cost of a car is related to its age. 

7.2.1 Scatter diagram 

This is the simplest method by which we can represent 
diagramatically a bivariate data. y > 
Suppose x and y denote 
respectively the age and weight 
of an adult male, then consider 
a sample of n individuals with 
ages x v x v x y ... x n and 
the corresponding weights as 
y v y v y v ... y n . Plot the points 




Fig. 7.1 



59 



(x 1 ,;y 1 ), (x 2 ,y 2 ),(x y y 3 ), ... (x n , y n ) on a rectangular co-ordinate 
system. The resulting set of points in a graph is called a scatter 
diagram. 

From the scatter diagram it is often possible to visualize a 
smooth curve approximating the data. Such a curve is called an 
approximating curve. In the above figure, the data appears to be 
well approximated by a straight line and we say that a linear 
relationship exists between the two variables. 

7.2.2 Principle of least squares 

Generally more than one curve of a given type will appear to 
fit a set of data. In constructing lines it is necessary to agree on a 
definition of a "best fitting line". 

Consider the data points 
(x v yj), (x 2 , y 2 ), (x 3 , y 3 ), ... 
(x n , y n ). For a given value of x, 
say jCj, in general there will be a 
difference between the value y, 

and the corresponding value as 

determined from the curve C 
(in Fig. 7.2) 




(•Wn) 



Fig. 7.2 



We denote this difference by d v which is referred to as a 
deviation or error. Here d x may be positive, negative or zero. 
Similarly corresponding to the values x 2 , x v ... x n we obtain the 
deviations d n , d-., ... d . 

A measure of the "goodness of fit" of the curve to the set of 
data is provided by the quantities d 2 , d 2 2 , ... d 2 . 

Of all the curves approximating a given set of data points, the 
curve having the property thatJj 2 +d 2 2 + d 2 +...+ d 2 is a minimum 
is the best fitting curve. If the approximating curve is a straight line 
then such a line is called the "line of best fit". 
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7.2.3 Derivation of normal equations by the principle of least 
squares. 

Let us consider the fitting of a straight line 

y = ax + b (1) 

to set of n points (x v y { ), (x 2 , y 2 ), ... (x n , y n ). 

For the different values 
of a and b equation (1) 
represents a family of straight 
lines. Our aim is to determine 
a and b so that the line (1) is 
the line of best fit. ■■%( x .,ax.+b) 



o 



P,tw,) 




M 
Fig. 7.3 



Now a and b are 
determined by applying 
principle of least squares. 

Let P. (x., y.) be any 
point in the scatter diagram. 
Draw PM perpendicular to x-axis meeting the line (1) in H. The 

jc-coordinate of H. is x.. The ordinate of H is ax . + b. 

it it 

PH. = PM - HM 

tit i 

= y j -(ax j +b) is the deviation for y.. 

According to the principle of least squares, we have to 
find a and b so that 

n 2 H 2 

E = Z PH. = Z [v. — (ax . + b)] is minimum. 

For maxima or minima, the partial derivatives of E with respect 
to a and b should vanish separately. 
3E 



da 



= 



-2Lx i [y i -(ax. + b)] = 



a 6 x 2 + b 6 x. = Ox y. 

3E 



-(2) 



db 



= 



2Z\y i -(ax i + b)] = 
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i.e., Ey. — dLx i — nb = 
n n 

=> a Ox +nb= 6 v. (3) 

(2) and (3) are known as the normal equations. Solving the 
normal equations we get a and b. 

Note 

The normal equations for the line of best fit of the form 
y = a + bx are 

na + bT.x. = Ey. 

i j i 

aLx. + bT,x. 2 = T,x.y. 

Example 16 

Fit a straight line to the following 

Ejc = 10, £y = 19, Ejc 2 = 30, T.xy = 53 and n = 5. 

Solution : 

The line of best fit is y = ax + b 

Ev = dLx + nb 

Zxy = aLx 2 + bZx 

=* I0a + 5b=l9 (1) 

30a + 10^ = 53 (2) 

Solving (1) and (2) we get, a = 1.5 and b = 0.8 

The line of best fit is y = 1 ,5x + 0.8 

Example 17 

In a straight line of best fit find x -intercept when 
Zx = 10, Ej=16.9, Ex 2 = 30, Exy = 47.4 and n = 7. 

Solution : 

The line of best fit is v = ax + b 
The normal equations are 

Ev = aLx + nb 
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Lxy = dLx 1 + bLx 






=> 10a + lb =16.9 




— (1) 


30a + 10b = 47 .4 




—(2) 


Solving (1) and (2) we get, 






a = 1 .48 and & = 


0.3 




The line of best fit is y 


= 1.48* + 0.3 


TVip v intprr-pnt nf tVip linp 


nf hpct fi 


t - 0.3 



Example 18 

Fit a straight line for the following data. 



x : 1 2 


3 4 


y : 1 1 3 


4 6 


Solution : 




The line of best fit is y = ax + b 




The normal equations are 




dLx + nb = Lv 


(1) 


aLx 2 + ££* = Lry 


(2) 


Now from the data 





x 2 xy 






1 








1 


1 


1 


1 


2 


3 


4 


6 


3 


4 


9 


12 


4 


6 


16 


24 



10 15 30 43 



By substituting these values in (1) and (2) we get, 

10a + 5^=15 (3) 

30^+10^ = 43 (4) 
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Solving (3) and (4) we get, a = 1.3 and b = 0.4 
The line of best fit is y = 1.3^: + 0.4. 

Example 19 

Fit a straight line to the following data: 



x : 


4 


8 


12 


16 


20 


24 


y : 


7 


9 


13 


17 


21 


25 


Solution : 















Take the origin at — r — =14 



Let w. = 



x -14 



Here n = 6 



The line of best fit is y = au + b 
The normal equations are dLu + nb = Lv 

aEw 2 + bEw = Ewv 



H 



HV 



4 


7 


-5 


25 


-35 


8 


9 


-3 


9 


-27 


12 


13 


-1 


1 


-13 


16 


17 


1 


1 


17 


20 


21 


3 


9 


63 


24 


25 


5 


25 


125 



Total 



92 







70 



130 



-(1) 
-(2) 



On substituting the values in the normal equation (1) and (2) 
a= 1.86 and b= 15.33 

' x 



The line of best fit is y = 1.86 



r^J+15.33 



0.93* +2.31 



Example 20 

Fit a straight line to the following data, 
x : 100 200 300 400 

y : 90.2 92.3 94.2 96.3 
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500 
98.2 



600 
100.3 



Solution : 
Let u 



x t -350 
50 



and v ; = y ; - 94.2 Here n = 6. 



The line of best fit is v = au + b 



•mal equations are 


dLu + 


nb = 


= v 


(1) 






dLu 2 4 


- bLu = 


= Zuv 


(2) 


X 


y 


u 


V 


u 2 


uv 


100 


90.2 


-5 


-4 


25 


20 


200 


92.3 


-3 


-1.9 


9 


5.7 


300 


94.2 


-1 





1 





400 


96.3 


1 


2.1 


1 


2.1 


500 


98.2 


3 


4 


9 


12 


600 


100.3 


5 


6.1 


25 


30.5 


Total 







63 


70 


70.3 



Substituting the values in (1) and (2) we get 
a= 1.0043 and &= 1.05 
The line of best fit is v = 1 .0043 u + 1 .05 
=> y = 0.02x +88.25 

EXERCISE 7.2 

Define a scatter diagram. 

State the principle of least squares. 

Fit the line of best fit if Ix = 75, ly = 115, Ix 2 = 1375, 
Zxy = 1875, and n - 6. 

In a line of best fit find the slope and the y intercept if 

Ix = 10, ly = 25, Ix 2 = 30, Ixy = 90 and n = 5. 

Fit a straight line y - ax + b to the following data by the 

method of least squares. 

x 13 6 8 

y 1 3 2 5 4 
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1) 

2) 
3) 

4) 

5) 



6) A group of 5 students took tests before and after training and 
obtained the following scores. 

Scores before training 3 4 4 6 8 

Scores after training 4 5 6 8 10 

Find by the method of least squares the straight line of best fit 

7) By the method of least squares find the best fitting straight 
line to the data given below: 

x : 100 120 140 160 180 200 

y : 0.45 0.55 0.60 0.70 0.80 0.85 

8) Fit a straight line to the data given below. Also estimate the 
value y at x - 3.5 

x : 1 2 3 4 

y : 1 1.8 3.3 4.5 6.3 

9) Find by the method of least squares, the line of best fit for the 
following data. 

Depth of water applied x : 12 24 36 48 

(in cm) 
Average yield y : 35 55 65 80 90 

(tons / acre) 

10) The following data show the advertising expenses (expressed 
as a percentage of total expenses) and the net operating profits 
(expressed as a percentage of total sales) in a random sample 
of six drug stores. 

Advertising expenses 0.4 1.0 1.3 1.5 2.0 2.8 
Net operating profits 1.90 2.8 2.9 3.6 4.3 5.4 
Fit a line of best fit. 

1 1) The following data is the number of hours which ten students 
studied for English and the scores obtained by them in the 
examinations. 

Hours studied x : 4 9 10 12 14 22 

Test score y : 31 58 65 68 73 91 

(i) Fit a straight line y - ax + b 

(ii) Predict the score of the student who studied for 17 hours. 
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EXERCISE 7.3 



Choose the correct answer 



1) 


A/(x) = 
(a)/(x+/*) 






(b)/(jc)-/(jc+A) 




(c)f(x+h)-f(x) 




(d)/(x)-/(x-fc) 


2) 


E 2 /(x) = 










(a)/(x+/*) 




(b)f(x+2h) 


(c)f(2h) (d)/(2x) 


3) 


E = 










(a) 1+A 




(b) 1 - A 


(c) V+ 1 (d) V-l 


4) 


Vf(x+3h) = 
(a)f(x+2h) 

(c)f(x+3h) 






(b)f(x+3h)-f(x+2h) 
(d)f(x+2h)-f(x-3h) 


5) 


When h = 1 


,A( 


x 2 ) = 






(a) 2x 




(b) 2x - 1 


(c) 2x+l (d) 1 


6) 


The normal 


equations for estimating a and b so that 



line y - ax + b may be the line of best fit are 

(a) aLx 2 + bLx. — Zx.y. and aLx. + nb — Zy. 

(b) aLx. + &Zx 2 = Zx.y. and aLx 2 + nb - Zy. 

(c) a£x. + nb — Zx.y. and aLx 1 + Z?Zx. = Zy. 

(d) aLx 2 + nb - Lx.y. and aLx. + bLx. - Ly. 

7) In a line of best fit y = 5.8 (x-1994) + 41.6 the value of y 
when x - 1997 is 

(a) 50 (b) 54 (c) 59 (d) 60 

8) Five data relating to x and y are to be fit in a straight line. It is 
found that Lx = and Ly - 15. Then the y-intercept of the line 
of best fit is, 

(a)l (b)2 (c)3 (d)4 
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9) The normal equations of fitting a straight line y - ax + b are 
10a +5b = 15 and 30a + 10b = 43. The slope of the line of best 
fit is 

(a) 1.2 (b) 1.3 (c) 13 (d) 12 

10) The normal equations obtained in fitting a straight line 
y - ax + b by the method of least squares over n points (x, y) 
are 4 = 4a + b and Zxy = 120a + 24b. Then n - 

(a) 30 (b) 5 (c) 6 (d) 4 
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PROBABILITY 
DISTRIBUTIONS 



8 



8.1 RANDOM VARIABLE AND 
PROBABILITY FUNCTION 

Random variable 

A random variable is a real valued function defined on a 
sample space S and taking values in (-00 , 00) 

8.1.1 Discrete Random Variable 

A random variable X is said to be discrete if it assumes only 
a finite or an infinite but countable number of values. 

Examples 

(i) Consider the experiment of tossing a coin twice. The sample 
points of this experiment are^ = (H, H), s 2 = (H, T), s 3 = (T, 
H) and s 4 = (T, T). 

Random variable X denotes the number of heads obtained in 
the two tosses. 

Then X( Sl ) = 2 X(s 2 ) = 1 

X(s 3 ) = 1 X(s 4 ) = 

R x = {0,1,2} 

where s is the typical element of the sample space,X(5) represents 
the real number which the random variable X associates with the 
outcome s. 

R x , the set of all possible values of X, is called the range 
space X 

(ii) Consider the experiment of rolling a pair of fair dice once. 
Then sample space 
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S = {(1,1) (1,2) (1,6) 



(6,1) (6, 2) (6,6)} 

Let the random variable X denote the sum of the scores on 
the two dice. Then R^ = {2,3,4, ,12}. 

(iii) Consider the experiment of tossing of 3 coins simultaneously. 

Let the random variable X be Number of heads obtained in 
this experiment. 

Then 

S = {HHH, HHT, HTT, TTT, TTH, THH, HTH, THT} 

R x = {0,1,2,3} 

(iv) Suppose a random experiment consists of throwing 4 coins 
and recording the number of heads. 

Then ^={0,1,2,3,4} 

The number of printing mistakes in each page of a book and 
the number of telephone calls received by the telephone operator 
of a firm, are some other examples of discrete random variable. 

8.1.2 Probability function and Probability distribution of a 
Discrete random variable 

Let X be a discrete random variable assuming values 
x v x 2 , x y .. If there exists a function/? denoted by p(x) = P[X =x.] 
such that 

(i) p(x)>0 forz = 1,2, ... 

(ii) £ p(x) = 1 

i 

then p is called as the probability function or probability mass 
function (p.m.f) of X. 
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The collection of all pairs (x., p(x)) is called the probability 
distribution of X. 

Example 1 

Consider the experiment of tossing two coins. Let X 
be a random variable denoting the number of heads obtained. 





X : 





1 


2 




p(.r.) : 


1 
4 


1 

2 


1 
4 




Is p(.r.) 


a p.m.f ? 




Solution : 








(i) 


p(x) > I 


for all i 




(ii) 


Lp(x) -- 


--p(0) 


+ />(!) + 


P(2) 






-^ 


h\ 


= 1 



Hence p(^ .) is a p.m.f. 

Example 2 

Consider the discrete random variable X as the sum of 
the numbers that appear, when a pair of dice is thrown. The 
probability distribution of X is 

X: 2 3 4567 89 10 11 12 

P&i) '• 36 36 36 36 36 36 36 36 36 36 36 
Is p(jc.) a p.m.f? 

Solution : 

p(x) > for all / 

(ii) Zp(x) = 36 + 36 + 36 + + 36 = 1 

Hence p(x) is a p.m.f. 
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8.1.3 Cumulative Distribution function : (c.d.f.) 

Let X be a discrete random variable. The function F(x) is 
said to be the cumulative distribution function (c.d.f.) of the random 
variable X if 

F(x) = P(X<x) 

= E p(x { ) where the sum is taken over i 

i 

such thatx <x. 

i — 

Remark : P(a < X < b) = F(b) - F(a) 

Example 3 

A random variable X has the following probability 



function : 






Values of X, x : -2 -1 


1 


2 3 


p(x): 0.1 k 


0.2 2k 


0.3 k 


(i) Find the value of k 






(ii) Construct the c.d.f. of X 






Solution : 







(i) Since Lpix) = 1, 

i 

p(-2) +p(-l) +p(0) +p(l) +p(2) +p(3) = 1 

0.1 +k + 0.2 + 2k + 03+k= 1 

0.6 + 4k = l^4k= 1-0.6 

.4 
4^ = 0.4 .:k= 47 = 0.1 

Hence the given probability function becomes, 
x : -2-10123 

p(x) : 0.1 0.1 0.2 0.2 0.3 0.1 

(ii) Cumulative distribution function F(x) = P(X < x) 
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X 


F(x) = P(X<;c) 


-2 


F(-2) = P(X < -2 ) =0.1 


-1 


F(-l) = P(X < -1 ) = P(X = -2) + P(X = -1) 




= 0.1 + 0.1=0.2 





F(0) =P(X<0) =P(X=-2) + P(X=-l)+P(X = 0) 




= 0.1+0.1+0.2 =0.4 


1 


F(l) =P(X<1) =0.6 


2 


F(2) =P(X<2) = 0.9 


3 


F(3) =P(X<3) =1 



F(x) = if x < -2 










= .lif 


—2 <jc < — 1 










= .2 if 


-1 <*<0 










= .4 if 


0<x< 1 










= .6 if 


l<;t<2 










= 0.9, 


if2<x<3 










= 1 if 


x > 3 








Example 4 












For the following probability distribution of X 




X 





1 2 


3 






p(x) 


• 1 
6 


1 3 

2 10 


1 

30 






Find 


(i)P(X< 


1) (ii)P(X<2) 


(iii) P(0< X < 2) 


Solution : 












(i) P(X< 


D= P(X 
= P(0) 

1 

= 6 + 


= 0) + P(X = 

+ p(l) 

1 4 2 
2-6-3 


1) 







73 



(ii) P(X < 2)= P(X = 0) + P(X = 1) + P(X = 2) 

_ I I JL _ 29 
- 6 + 2 + 10 - 30 

AliterP(X < 2 ) can also be obtained as 

P(X < 2 ) = 1 - P(X > 2) 

1 29 
= 1-P(X = 3) =1-30=30 

(iii) P(0 < X < 2) = P(X = 1) =^ 

8.1.4 Continuous Random Variable 

A random variable X is said to be continuous if it takes a 
continuum of values, i.e. if it takes all possible values between certain 
defined limits. 

For example, 

(i) The amount of rainfall on a rainy day. 

(ii) The height of individuals, (iii) The weight of individuals. 

8.1.5 Probability function 

A function / is said to be the probability density function 
(p.d.f) of a continuous random variable X if the following conditions 
are satisfied 

(i)/0O > for all x (ii) ]f(x) dx = 1 

Remark : 

(i) The probability that the random variable X lies in the interval 

b 

(a, b) is given by P(a < X < b) = )f(x)dx. 

a a 

(ii) P(X = a)= \f(x)dx = 

a 

(iii) P(a < X <b) = P(a < X < b) = P(a < X < b) = P(a < X < b) 
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8.1.6 Continuous Distribution function 

If X is a continuous random variable with p.d.f. f(x), then 
the function F x (x) = P(X < x) 



\f(t)dt 



is called the distribution function (d.f.) or cumulative distribution 
function (c.d.f) of the random variable X. 

Properties : The cumulative distribution function has the 
following properties. 

(i) x Jr^F(*) = i.e.F(-oo) = 

(ii) U o ¥{x) = \ i.e.F(oo) = l 

(iii) Let F be the c.d.f. of a continuous random variable X with 

p.d.f /. Then/(jc) = — F(x) for all x at which F is differentiable. 
dx 

Example 5 

A continuous random variable X has the following p.d.f. 
\k{2-x) for0<x<2 
^ = J otherwise 

Determine the value of k. 
Solution : 

If f(x) be the p.d.f., then \f(x) dx =1 

2 ~°° 

)f(x)dx + ]f(x)dx + )f(x)dx = 1 

2 2 

=> 0+ \f(x)dx + = 1 



2 



j k(2 — x) dx =1 



o 
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(1 



2dx - xdx 
1 



k = ± 



Hence/(x) 



(2-x) for0<;t<2 







otherwise 



Example 6 

Verify that 

\3x 2 forO<x<l 

[0 otherwise 

is a p.d.f and evaluate the following probabilities 



M = 



(i) P(X<|) 



(ii)P(I<X<I) 



Solution 



Clearly /(x) > for all jc and hence one of the conditions for 
p.d.f is satisfied. 

i i 

jf(x)dx = ]f(x)dx = j3x 2 dx = 1 



The other condition for p.d.f is also satisfied. 
Hence the given function is a p.d.f 



(i) P(X<I) 



3 

= jf(x)dx P(X<x)= \f(t)dt 
x 



(») P(i<X<I)= }/(*)<& 



■F 



dbc = i - -^= 



1 _ 19 



8 27 216 
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Example 7 

Given the p.d.f of a continuous random variable X as 
follows 

\kx{\-x) for0<jc<l 

^ = { otherwise 

Find k and c.d.f 

Solution : 

If X is a continuous random variable with p.d.f f(x) then 

I f(x) dx =1 



I k x ( 1-x) dx = 1 








i 


k 


' x 1 x 3 ' 
_2 3_ 


= 1 .-. * = 




Hence the given p 


.d.f becomes, 


[6x( 

m = { 


l-x) forO<;t<l 
otherwise 


To find c.d.f F(x) 




F 


(x)=0 


for x < 



F(x)=P(X<x)= \f(x)dx 

X 

= ^6x(l-x)dx =3x 2 -2x i for0<jc<l 



F(jc)= 1 forx>l 

.-. The c.d.f of X is as follows. 

F(jc) = for*<0 

= 3x 2 -2x 3 forO<x<l. 

= 1 f or x > 1 
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Example 8 

Suppose that the life in hours of a certain part of radio 
tube is a continuous random variable X with p.d.f is given by 

100 



m = 



2 ' 

X 



when x > 100 



elsewhere 



(i) What is the probability that all of three such tubes in a 
given radio set will have to be replaced during the first 
of 150 hours of operation? 

(ii) What is the probability that none of three of the original 
tubes will have to be replaced during that first 150 hours 
of operation? 

Solution : 

(i) A tube in a radio set will have to be replaced during the first 150 
hours if its life is < 150 hours. Hence, the required probability 
'p' that a tube is replaced during the first 150 hours is, 

150 

p = P(X<150) = \f(x)dx 

100 

150 

100 

The probability that all three of the original tubes will have to 

"3 dt 1 



replaced during the first 150 hours =p = hr =97" 

(ii) The probability that a tube is not replaced during the first 150 
hours of operation is given by 

P(X > 150) = 1 - P(X < 150) = 1 - I = I 

the probability that none of the three tubes will be replaced 

(2^ 8 
during the 150 hours of operation = T = "^7 
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EXERCISE 8.1 



1) 



2) 



3) 



4) 



5) 



Which of the following set of functions define a probability 
space onS = [x., x~, xj? 



1' /v 2' ^3 J 

(i) p{x x ) = j p(x 2 ) = 2 
(ii) /?(x x ) = j p(x 2 ) = g 

(iii) /7(Xj) = p(x 2 ) = j- 

2 

(iv) />(*,) =/?(*,) = T 



P(* 3 ) = 4 
p(* 3 ) = 2 
P(x 3 ) = "J 



Consider the experiment of throwing a single die. The random 
variable X represents the score on the upper face and assumes 
the values as follows: 

X : 1 2 3 4 5 6 

*,> = f f f f f f^.^ 

A random variable X has the following probability distribution. 

Values of X, x : 1 2345 678 

p(x) : a 3a 5a 7a 9a 11a 13a 15a 17a 
(i) Determine the value of a 
(ii) Find P(X < 3), P(X > 3) and P(0< X < 5) 
The following function is a probability mass function - Verify. 

4- for x = l 
p(x) = •{-§ for x = 2 

otherwise Hence find the c.d.f 

Find k if the following function is a probability mass function. 

' k 



pix) = 



for x = 

for x = 2 

for x = 4 
otherwise 
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6) A random variable X has the following probability 
distribution 

values of X, x : —2 5 

, ,. 1 11 

P(x). 442 

Evaluate the following probabilities 

(a) P(X < 0) (b) P(X < 0) (c) P((k X < 10) 

7) A random variable X has the following probability function 

Values of X, x : 1 2 3 

_L 1 Jl 

P(x) : 16 8 k 16 

(i) Find the value of k (ii) Construct the c.d.f. of X 

8) A continuous random variable has the following p.d.f 
f{x)=kx 2 , 0<x<10 

= otherwise. 
Determine k and evaluate (i) P(.2 < X < 0.5) (ii) P(X < 3) 

9) If the function f(x) is defined by 

f(x) - ce~ x , < x < °°. Find the value of c. 

10) Let X be a continuous random variable with p.d.f. 
ax, < x < 1 
a, l<x<2 

■ ax + 3a, 2 < x< 3 
otherwise 



f(x) = 



(i) Determine the constant a 
(ii) Compute P(X < 1.5) 

11) Let X be the life length of a certain type of light bulbs in 
hours. Determine 'a' so that the function 

f(x)=-^, 1000 <x< 2000 

= otherwise, 
may be the probability density function. 
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12) The kms. X in thousands which car owners get with a certain 
kind of tyre is a random variable having p.d.f. 

f(x)=^e~™, forx>0 

= for x < 

Find the probabilities that one of these tyres will last 
(i) atmost 10,000 kms 
(ii) anywhere from 16,000 to 24,000 kms 
(iii) atleast 30,000 kms. 

8.2 MATHEMATICAL EXPECTATION 

The concept of Mathematical expectation plays a vital role in 
statistics. Expected value of a random variable is a weighted average 
of all the possible outcomes of an experiment. 

If X is a discrete random value which can assume values 
x v x 2 , ... x n with respective probabilities/?^.) = P[X =x]; i= 1,2 
. . .n then its mathematical expectation is defined as 

E(X) = £ x.p(x.), (Here £ p(x) = 1) 

If X is a continuous random variable with probability density 
function f(x), then 

E(X) = \x f(x) dx 

Note 

E(X) is also known as the mean of the random variable X. 

Properties 

1) E(c) = c where c is constant 

2) E(X + Y) = E(X) + E(Y) 

3) E(aX + b) = <aE(X) + b where a and b are constants. 

4) E(XY) = E(X) E(Y) if X and Y are independent 
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Note 

The above properties holds good for both discrete and 
continous random variables. 

Variance 



by Var (X) orO 2 is 



Let X be a random variable. Then the Variance of X, denoted 
r(X) oralis 

Var(X) = o 2 x = E[X - E(X)] 2 

= E(X 2 ) - [E(X)f 

The positive square root of Var(X) is called the Standard 
Deviation of X and is denoted by c x . 

Example 9 

A multinational bank is concerned about the waiting time 
(in minutes) of its customer before they would use ATM for 
their transaction. A study of a random sample of 500 
customers reveals the following probability distribution. 

X : 012345678 

p(x) : .20 .18 .16 .12 .10 .09 .08 .04 .03 

Calculate the expected value of waiting time, X, of the 
customer 

Solution : 

Let X denote the waiting time (in minutes) per customer. 
X :012345678 
p(x) : .20 .18 .16 .12 .10 .09 .08 .04 .03 

Then E(X)= Zx pix) 

= (0 x .2) + (1 x 0.18) + ... + (8 x 0.03) = 2.71 

The expected value of X is equal to 2.71 minutes. Thus the 
average waiting time of a customer before getting access to ATM is 
2.71 minutes. 
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Example 10 

Find the expected value of the number of heads 
appearing when two fair coins are tossed. 

Solvtion : 

Let X be the random variable denoting the number of heads. 
Possible values of X : 1 2 
Probabilities p(x) : T 9" T 
The Expected value of X is 
E(X) = x l p(x l )+ x 2 p(x 2 ) + x 3 p(x 3 ) 
1} Jl} J\ 



= 01 jj + 1(^1 + 2^4 1 = 1 

Therefore, the expected number of heads appearing in the 
experiment of tossing 2 fair coins is 1 . 

Example 11 

The probability that a man fishing at a particular place 
will catch 1, 2, 3, 4 fish are 0.4, 0.3, 0.2 and 0.1 respectively. 
What is the expected number of fish caught? 

Solution : 



ssible 


values of X : 


1 


2 3 


4 




)babil 


ities p(x.) : 


0.4 


0.3 0.2 


0.1 




E(X) 


= T.x t p(x) 

i 












= x x p{x^ +x 2 


p(x 2 ) +x 2 p(x 3 ) ■ 


±x 4 p(x 


4) 




= 1 (-4) +2(.3) 


+ 3(.2) 


+ 4(.l) 







= .4 + .6 + .6 + .4 = 2 

Example 12 

A person receives a sum of rupees equal to the square 
of the number that appears on the face when a balance die is 
tossed. How much money can he expect to receive? 
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Solution : 

Random variable X: as square of the number that can appear 
on the face of a die. Thus 

possible values of X : l 2 2 2 3 2 4 2 5 2 6 2 
probabilities p(x) ■ \ \ \ \ \ \ 
The Expected amount that he receives, 

E(X)=l^) + 2^) + ... + 6^ 

6 

Example 13 

A player tosses two fair coins. He wins Rs.5 if two heads 
appear, Rs. 2 if 1 head appears and Rs.l if no head occurs. 
Find his expected amount of gain. 

Solution : 

Consider the experiment of tossing two fair coins. There are 
four sample points in the sample space of this experiment. 

i.e. S = {HH, HT, TH, TT} 

Let X be the random variable denoting the amount that a 
player wins associated with the sample point. 

Thus, 



Possible values of X (Rs.) : 5 


2 


1 


Probabilities p(x) : -r 


1 

2 


1 

4 


E(X)=5^) + 2^) + l(}) 






= I i I = 10 = 5 

4 + + 4 4 2 






= Rs. 2.50 






Hence expected amount of winning is Rs.2.50 
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Example 14 












A random variable X has the probability 


function 


as 


follows : 












values of X : 


-1 


1 








probability : 


0.2 


0.3 0.5 








Evaluate (i) E(3X 


+1) 


(ii) E(X 2 ) 


(iii) Var(X) 






Solution : 












X : -1 





1 








p(x) : 0.2 


0.3 


0.5 









(i) E(3X+1)= 3E(X) + 1 

Now E(X) = -1 x 0.2 + x 0.3 + 1 x 0.5 

= -1 x0.2 + + 0.5 = 0.3 
E(3X+1) = 3(0.3) +1 = 1.9 

(ii) E(X 2 )=Zx 2 p(x) 

= (-1) 2 x 0.2 + (0) 2 x 0.3 + (l) 2 x 0.5 

= 0.2 + + 0.5 = 0.7 
(iii) Var(X) = EQP) - [E(X)] 2 

= .7 - (.3) 2 = .61 
Example 15 

Find the mean, variance and the standard deviation for 
the following probability distribution 

Values of X, x : 1 2 3 4 

probability, p(x) : 0.1 0.3 0.4 0.2 

Solution : 

Mean =E(X)=Lxp(x) 

= 1(0.1) + 2(0.3) + 3(0.4) + 4(0.2) = 2.7 
Variance = E(X 2 ) - [E(X)f 

Now E(X 2 ) =Hx 2 p(x) 

= 1 2 (0.1) +2 2 (0.3) +3 2 (0.4) + 4 2 (0.2) = 8.1 
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.-. Variance = 8.1 - (2.7) 2 

= 8. 1-7. 29 = .81 
Standard Deviation = V0.81 =0.9 

Example 16 

Let X be a continuous random variable with p.d.f. 

for - 1 < x < 1 
otherwise 

Find (i) E(X) (ii) ECX 2 ) (iii) Var(X) 
Solution : 



f(x) = 



1 

2 



(i) 



© 



E(X) = \xf(x) dx 

- I )*o* = \ 

l 

E(X 2 )= \* 1 f(x)dx 



(by definition) 



= 



1 

2 



I 

3 



(iii) Var(X) = EQ^) - [E(X)] 2 

=i-o=i 



EXERCISE 8.2 

1) A balanced die is rolled. A person recieves Rs. 10 when the 
number 1 or 3 or 5 occurs and loses Rs. 5 when 2 or 4 or 6 
occurs. How much money can he expect on the average per 
roll in the long run? 

2) Two unbiased dice are thrown. Find the expected value of the 
sum of the points thrown. 
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3) A player tossed two coins. If two heads show he wins Rs. 4. 
If one head shows he wins Rs. 2, but if two tails show he must 
pay Rs. 3 as penalty. Calculate the expected value of the sum 
won by him. 

4) The following represents the probability distribution of D, the 
daily demand of a certain product. Evaluate E(D). 

D : 1 2 3 4 5 

P[D=d] : 0.1 0.1 0.3 0.3 0.2 

5) Find E(2X-7) and E(4X + 5) for the following probability 
distribution. 

X : -3 -2 -1 1 2 3 

p(x) : .05 .1 .3 .3 .15 .1 

6) Find the mean, variance and standard deviation of the following 
probability distribution. 

Values of X : -3 -2 -1 1 2 3 

D K LTl r \ 1111111 

Probability p(x) : - - - - - - - 

7) Find the mean and variance for the following probability 
distribution. 

\le- 2 \ x>0 
[0 , x<0 

8.3 DISCRETE DISTRIBUTIONS 

We know that the frequency distributions are based on 
observed data derived from the collected sample information. For 
example, we may study the marks of the students of a class and 
formulate a frequency distribution as follows: 



Marks 


No. of students 


0-20 


10 


20-40 


12 


40-60 


25 


60-80 


15 


80-100 


18 


Total 


80 
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The above example clearly shows that the observed frequency 
distributions are obtained by grouping. Measures like averages, 
dispersion, correlation, etc. generally provide us a consolidated view 
of the whole observed data. This may very well be used in 
formulating certain ideas (inference) about the characteristics of the 
whole set of data. 

Another type of distribution in which variables are distributed 
according to some definite probability law which can be expressed 
mathematically are called theoretical probability distribution. 

The probability distribution is a total listing of the various values 
the random variable can take along with the corresponding 
probabilities of each value. For example; consider the pattern of 
distribution of machine breakdown in a manufacturing unit. The 
random variable would be the various values the machine breakdown 
could assume. The probability corresponding to each value of the 
breakdown as the relative frequency of occurence of the breakdown. 
This probability distribution is constructed by the actual breakdown 
pattern discussed over a period of time. 

Theoretical probability distributions are basically of two types 

(i) Discrete and (ii) Continuous 

In this section, we will discuss theoretical discrete distributions 
namely, Binomial and Poisson distributions. 

8.3.1 Binomial Distribution 

It is a distribution associated with repetition of independent 
trials of an experiment. Each trial has two possible outcomes, 
generally called success and failure. Such a trial is known as 
Bernoulli trial. 

Some examples of Bernoulli trials are : 

(i) a toss of a coin (Head or tail) 

(ii) the throw of a die (even or odd number) 
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An experiment consisiting of a repeated number of Bernoulli 
trials is called a binomial experiment. A binomial experiment must 
possess the following properties: 

(i) there must be a fixed number of trials. 

(ii) all trials must have identical probabilities of success (p) i.e. if 
we call one of the two outcomes as "success" and the other 
as "failure", then the probability p of success remains as 
constant throughout the experiment. 

(iii) the trials must be independent of each other i.e. the result of 
any trial must not be affected by any of the preceeding trial. 

Let X denote the number of successes in ' rt trials of a binomial 
experiment. Then X follows a binomial distribution with parameters 
n and p and is denoted by X~B(n, p). 

A random variable X is said to follow Binomial distribution 

with parameters n and/?, if it assumes only non-negative values and 
its probability mass function is given by 

P[X=x] =p(x) = n C x p x q nx ; x=0, l,2,..n;q= 1- p 
Remark 

(i) ip(x)= i n C x p x q nx =(q+pT=l 



/fl »p = n(n-l)...(n-r-l) 
V r 1.2.3...r 

Mean and Variance 

For the binomial distribution 

Mean = np 

Variance = npq; Standard Deviation = -Jnpq 

Example 17 

What is the probability of getting exactly 3 heads in 8 
tosses of a fair coin. 
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Solution : 

Let p denote the probability of getting head in a toss. 
Let X be the number of heads in 8 tosses. 

Then p = — , q = — and n = 8 



Probability of getting exactly 3 heads is 

'*1 (* 



P(X = 3) = 8 c 3 



8 x 7 x 6 ( it 7 



1x2x3 12 j 32 

Example 18 

Write down the Binomial distribution whose mean is 20 
and variance being 4. 

Solution : 

Given mean, np = 20 ; variance, npg = 4 

Now q = -n = £- = 1 .-. p = 1 - ? = 1 

^ np 20 5 F y 5 

From np = 20, we have 

20 20 

n = — = — = 25 
y 5 

Hence the binomial distribution is 

p(x)= n Cyq nx =25 C f|J f^J X , ^=0, 1,2, ...,25 

Example 19 

On an average if one vessel in every ten is wrecked, 
find the probability that out of five vessels expected to arrive, 
atleast four will arrive safely. 

Solution : 

Let the probability that a vessel will arrive safely, p = — 
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Then probability that a vessel will be wrecked, q = \-p = — 

No. of vessels, n = 5 

.-. The probability that atleast 4 out of 5 vessels to arrive 
safely is 

P(X > 4) = P(X = 4) + P(X = 5) 

4 ^ioJ 10 \io 

= 5(.9) 4 (.l) + (.9) 5 =.91854 

Example 20 

For a binomial distribution with parameters n = 5 and 
p = .3 find the probabilities of getting (i) atleast 3 successes 
(ii) atmost 3 successes. 

Solution : 

Given n = 5, p = .3 .-. q = .7 
(i) The probability of atleast 3 successes 

P(X>3) = P(X =3) + P(X = 4) + P(X = 5) 

= 5 C 3 (0.3) 3 (0.7) 2 + 5 C 4 (0.3) 4 (0.7) + 5 C 5 (.3) 5 (7)° 
= .1631 
(ii) The probability of atmost 3 successes 

P(X < 3)= P(X =0) + P(X = 1) + P(X = 2) + P(X = 3) 

= (.7) 5 + 5 C t (.7) 4 (.3) + 5 C 2 (.7) 3 (.3) 2 + 5 C 3 {.if (.3) 3 

= .9692 

8.3.2 Poisson distribution 

Poisson distribution is also a discrete probability distribution 
and is widely used in statistics. Poisson distribuition occurs when 
there are events which do not occur as outcomes of a definite number 
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of trials of an experiment but which occur at random points of time 
and space wherein our interest lies only in the number of occurences 
of the event, not in its non-occurances. This distribution is used to 
describe the behaviour of rare events such as 

(i) number of accidents on road 

(ii) number of printing mistakes in a book 

(iii) number of suicides reported in a particular city. 

Poisson distribution is an approximation of binomial 
distribution when n (number of trials) is large and p, the probability 
of success is very close to zero with np as constant. 

A random variable X is said to follow a Poisson distribution 

with parameter X > if it assumes only non-negative values and 
its probability mass function is given by 

P[X = x] =p(x) = ^- ; x = 0, 1, 2, ... 

Jit 

Remark 

It should be noted that 

ZP[X = x]= £ p(x) = 1 

x=0 x=0 

Mean and Variance 

For the of poisson distribution 

Mean, E(X) = X , Variance, Var(X) = X, S.D = VI 

Note 

For poission distribution mean and variance are equal. 

Example 21 

Find the probability that atmost 5 defective fuses will 
be found in a box of 200 fuses if experience shows that 2 
percent of such fuses are defective, (e^ 1 = 0.0183) 
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Solution : 

2 
p = probability that a fuse is defective = — — - 

n = 200 

••• l = np= T ^x200 = 4 

Let X denote the number of defective fuses found in a box. 
Then the distribution is given by 

P[X=x]=p(x) = ^f 

So, probability that atmost 5 defective fuses will be found in 
a box of 200 fuses 

= P(X<5) 

= P(X = 0) + P(X = 1) + P(X = 2) 

+ P(X = 3) + P(X =4) + P(X = 5) 

= e - 4+ iLl4 + e^4l + £ ^4l + iLV + ^ 4 4 5 



1! 2! 3! 4! 5! 

Q ^ n ._ 4 ,_ 4' 4' 4' 4 5 x 

=e (1 + i! + ir + ¥ + ^r + ^! ) 

=0.0183 x ^ =0.785 

Example 22 

Suppose on an average 1 house in 1000 in a certain 
district has a fire during a year. If there are 2000 houses in 
that district, what is the probability that exactly 5 houses will 
have fire during the year? (e~ 2 = .13534) 

Solution : 

p = probability that a house catches fire = 
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Here n = 2000 .-. l = np = 2000 x -J— = 2 
Let X denote the number of houses that has a fire 

— 2 /} x 

Then the distribution is given by P[X =x] = : — , x = 0, 1,2,... 

Probability that exactly 5 houses will have a fire during 
the year is 

e- 2 2 5 



P[X = 5] 

.13534x32 
120 



5! 
.0361 



Example 23 

The number of accidents in a year attributed to taxi 
drivers in a city follows poisson distribution with mean 3. Out 
of 1000 taxi drivers, find the approximate number of drivers 
with 

(i) no accident in a year 

(ii) more than 3 accidents in a year 

Solution : 

Here X = np = 3 

N = 1000 
Then the distribution is 

P[X =x] = , where X denotes the number accidents. 
xl 

(i) P (no accidents in a year) = P(X = 0) 

= e" 3 = 0.05 
.-. Number of drivers with no accident = 1000 x 0.05 = 50 

(ii) P (that more than 3 accident in a year ) = P(X > 3) 
= 1-P(X<3) 

-3/-.1 -3 2 -3^3' 

- I - + it + 2! 3! 

94 



= l-e" 3 [l + 3 + 4.5 + 4.5] 
= l-e" 3 (13) = 1 - .65 = .35 
.-. Number of drivers with more than 3 accidents 
= 1000x0.35 =350 

EXERCISE 8.3 

1) Ten coins are thrown simultaneously. Find the probability of 
getting atleast 7 heads. 

2) In a binomial distribution consisting of 5 independent trials, 
probabilities of 1 and 2 successes are 0.4096 and 0.2048 
respectively. Find the parameter '/?' of the distribution. 

3) For a binomial distribution, the mean is 6 and the standard 
deviation is -J2 . Write down all the terms of the distribution. 

4) The average percentage of failure in a certain examination is 
40. What is the probability that out of a group of 6 candidates 
atleast 4 passed in the examination? 

5) An unbiased coin is tossed six times. What is the probability of 
obtaining four or more heads? 

6) It is stated that 2% of razor blades supplied by a manufacturer 
are defective. A random sample of 200 blades is drawn from 
a lot. Find the probability that 3 or more blades are defective. 
(e" 4 = .01 832) 

7) Find the probability that atmost 5 defective bolts will be found 
in a box of 200 bolts, if it is known that 2% of such bolts are 
expected to be defective (e 4 - 0.01832) 

8) An insurance company insures 4,000 people against loss of 
both eyes in car acidents. Based on previous data, the rates 
were computed on the assumption that on the average 10 
persons in 1 ,00,000 will have car accidents each year that result 
in this type of injury. What is the probability that more 
than 3 of the injured will collect on their policy in a given 
year? (e^ A = 0.6703) 
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9) It is given that 3% of the electric bulbs manufactured by a 
company are defective. Find the probability that a sample of 
100 bulbs will contain (i) no defective (ii) exactly one defective. 
(e~ 3 = 0.0498) . 

10) Suppose the probability that an item produced by particular 
machine is defective equals 0.2. If 10 items produced from 
this machine are selected at random, what is the probability 
that not more than one defective is found? (e~ 2 = . 13534) 

8.4 CONTINUOUS DISTRIBUTIONS 

The binomial and Poisson distributions discussed in the 
previous section are the most useful theoretical distributions. In 
order to have mathematical distribution suitable for dealing with 
quantities whose magnitudes vary continuously like heights and 
weights of individuals, a continuous distribution is needed. Normal 
distribution is one of the most widely used continuous distributions. 

8.4.1 Normal Distribution 

Normal Distribution is considered to be the most important 
and powerful of all the distributions in statistics. It was first 
introduced by De Moivre in 1733 in the development of probability. 
Laplace (1749 - 1827) and Gauss (1827 - 1855) were also 
associated with the development of Normal distribution. 

A random variable X is said to follow a Normal Distribution 

with mean (I and variance o 2 denoted by X ~ N(ji, a 2 ), if its 
probability density function is given by 

2 
U-M-) 

I 2 

f(x) = — 1= e 2o , -oo<^<oo ) -oo<(j,<oo ) a>0 
GV271 

Remark 

The parameters (I and a 2 completely describe the normal 
distribution. Normal distribution could be also considered as hmiting 
form of binomial distribution under the following conditions: 
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(i) n, the number of trials is indefinitely large i.e. n —> °° 
(ii) neither p nor q is very small. 

The graph of the p.d.f of the normal distribution is called the 
Normal curve, and it is given below. 




X = \i 

Normal probability curve 



8.4.2 Properties of Normal Distribution 

The following are some of the important properties of the 
normal curve and the normal distribution. 

(i) The curve is "bell - shaped" and symmetric about x = (I 

(ii) Mean, Median and Mode of the distribution coincide. 

(iii) There is one maximum point of the normal curve which occurs 
at the mean (|i). The height of the curve declines as we go in 
either direction from the mean. 

(iv) The two tails of the curve extend infinitely and never touch 
the horizontal (x) axis. 

(v) Since there is only one maximum point, the normal curve is 
unimodal i.e. it has only one mode. 

(vi) Since f(x) being the probability, it can never be negative and 

hence no portion of the curve lies below the x - axis. 
(vii) The points of inflection are given by x = \l±o 
(viii) Mean Deviation about mean 

71 5 
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(ix) Its mathematical equation is completely determined if the mean 
and S.D are known i.e. for a given mean \x and S.D o, there 
is only one Normal distribution. 

(x) Area Property : For a normal distribution with mean \x and 
S.D o, the total area under normal curve is 1, and 

(a) P(n- o<X<(i + o) = 0.6826 
i.e. (mean) + 1 o covers 68.27%; 

(b) P((i-2o<X<(i + 2o) = 0.9544 

i.e. (mean) + 2o covers 95.45% area 

(c) P((i-3a<X<(i + 3o) = 0.9973 

i.e. (mean) + 3o covers 99.73% area 

10.4.3 Standard Normal Distribution 

A random variable which has a normal distribution with a mean 
(1 = and a standard deviation o = 1 is referred to as Standard 
Normal Distribution. 

Remark 

IfX~N(ji,a 2 ), then Z 



(i) 
(M) 



X-(i 



is a standard normal variate 



with E(Z) = and var(Z) = 1 i.e. Z ~ N(0, 1). 

It is to be noted that the standard normal distribution has the 
same shape as the normal distribution but with the special 
properties of (I = and a = 1 . 




z = o 
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A random variable Z is said to have a standard normal 
distribution if its probability density function is given by 

1 „ -f 



<p(z) 



12% 



°° < Z < °° 



Example 24 

What is the probability that Z 



lies between and 1.83 
is greater than 1.54 
is greater than -0.86 
lies between 0.43 and 1.12 
is less than 0.77 



(a) 
(b) 
(c) 
(d) 
(e) 

Solution : 

(a) Z lies between and 1.83 




Z=0 1.83 

P(0 < Z < 1 .83) = 0.4664 (obtained from the tables directly) 
(b) Z is greater than 1.54 i.e. P(Z > 1.54) 




Since the total area to the right of Z = is 0.5 and area between 
Z = and 1.54 (from tables) is 0.4382 

P(Z > 1 .54)= 0.5 - P(0 < Z < 1 .54) 

= 0.5 -.4382 = .0618 

(c) Z is greater than -0.86 i.e. P(Z > -0.86) 



(d) 




Here the area of interest P(Z > -0.86) is represented by the 
two components. 

(i) Area between Z = -0.86 and Z = 0, which is equal to 
0.3051 (from tables) 

(ii) Z>0, which is 0.5 

.-. P(Z>-0.86) = 0.3051 +0.5 = 0.8051 
Z lies between 0.43 and 1.12 




Z=0 .43 1.12 



P(0.43 <Z< 1.12) = P(0<Z< 1.12)- P(0<Z< 0.43) 
= 0.3686-0.1664 (from tables) 
= 0.2022. 
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(e) Z is less than 0.77 




Z=0 .77 
P( Z < 0.77) = 0.5 + P(0 < Z < 0.77) 

= 0.5 + .2794 = .7794 (from tables) 
Example 25 

If X is a normal random variable with mean 100 and 
variance 36 

find (i) P(X > 112) (ii) P(X < 106) (iii) P(94 < X < 106) 

Solution : 

Mean, (1=100; Variance, o 2 = 36 ; S.D, o = 6 
Then the standard normal variate Z is given by 



Z = 



X-(i _ X-100 



(i) When X = 112, then Z = 112 10 ° = 2 
.-. P(X> 112) = P(Z>2) 




P (0 < Z < oo) - P(0 < Z < 2) 
0.5 - 0.4772 = 0.0228 (from tables) 
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(ii) For a given value X = 106, Z = 7 = 1 




P(X< 106) = P(Z< 1) 

= P (-00 < Z < 0) + P(0 < Z < 1) 

= 0.5 + 0.3413 = 0.8413 (from tables) 

r, 94-100 



(iii) When X = 94 



-1 



x = 106 , z=l^loo = +1 




z=o 



P(94 < X < 106) = P(-l < Z < 1) 

= P(-1 <Z<0) + P(0<Z< 1) 
= 2 P(0 < Z < 1) (by symmetry) 

= 2 (0.3413) 
= 0.6826 
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Example 26 

In a sample of 1000 candidates the mean of certain test 
is 45 and S.D 15. Assuming the normality of the distrbution 
find the following: 

(i) How many candidates score between 40 and 60? 

(ii) How many candidates score above 50? 

(iii) How many candidates score below 30? 

Solution : 

Mean = (i = 45 and S.D. = a = 15 
X-u. _ X-45 



Then Z = 



15 



(i) P (40 < X < 60) = P( — A g 45 < Z < 60 1 g 45 ) 



15 
= P(-|-<Z<1) 



15 




= P(- j < Z < 0) + P(0 < Z < 1 ) 

= P(0 < Z < 0.33) + P(0 < Z < 1) 
= 0.1293 + 0.3413 (from tables) 
P(40 < X < 60) = 0.4706 
Hence number of candidates scoring between 40 and 60 
= 1000x0.4706 = 470.6-471 
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(ii) P(X>50)= P(Z>j) 




z=o I 

3 

= 0.5 - P(0 < Z <!-) = 0.5 - P(0 < Z 0.33) 

= 0.5-0.1293 = 0.3707 (from tables) 
Hence number of candidates scoring above 50 

= 1000x0.3707 = 371. 
(iii) P(X<30) =P(Z<-1) 




-l z=o 



= 0.5-P(-l<Z <0) 

= 0.5-P(0<Z< 1) ^Symmetry 

= 0.5-0.3413 = 0.1587 (from tables) 
.-. Number of candidates scoring less than 30 
= 1000x0.1587= 159 

Example 27 

The I.Q (intelligence quotient) of a group of 1000 school 
children has mean 96 and the standard deviation 12. 
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Assuming that the distribution of I.Q among school children 
is normal, find approximately the number of school children 
having I.Q. 

(i) less than 72 (ii) between 80 and 120 

Solution : 

GivenN = 1000, u. = 96 and o = 12 
X-n _ X-96 



Then Z = 
(i) P(X < 72) 



o 12 

P(Z < -2) 




= P(-oo < Z < 0) - P(-2 < Z < 0) 

= P(0 < Z < oo) - p(0 < Z < 2) (By symmetry) 

= 0.5 - 0.4772 (from tables) = 0.0228. 

Number of school children having I.Q less than 72 
=1000x0.0228 =22.8~23 

(ii) P(80<X< 120) = P(-1.33<Z<2) 




= P(-1.33 < Z < 0) + P(0< Z < 2) 
= P(0<Z< 1.33) + P(0<Z<2) 
= .4082 + .4772 (from tables) 
= 0.8854 
.-. Number of school children having I.Q. between 80 and 120 
= 1000 x. 8854 =885. 

Exercise 28 

In a normal distribution 20% of the items are less than 
100 and 30% are over 200. Find the mean and S.D of the 
distribution. 

Solution : 

Representing the given data diagramtically, 




From the diagram 

P(-Zj<Z<0) =0.3 
i.e. P(0 < Z < Zj) = 0.3 

Z x = 0.84 (from the normal table) 

100 -u 
Hence -0.84 = — ^ — 

i.e. 100- (I = -0.84a (1) 



P(0<Z<Z 2 ) =0.2 
Z 2 = 0.525 (from the normal table) 

106 



200 -LL 

Hence 0.525 = — ^-^ 

i.e. 200 -|i = 0.525c (2) 

Solving (1) and (2), ^ = 161.53 
a =73.26 

EXERCISE 8.4 

1) Find the area under the standard normal curve which lies 
(i) to the right of Z = 2.70 

(ii) to the left of Z = 1.73 

2) Find the area under the standard normal curve which lies 
(i) between Z = 1.25 and Z = 1.67 

(ii) between Z = -0.90 and Z = -1.85 

3) The distribution of marks obtained by a group of students may 
be assumed to be normal with mean 50 marks and standard 
deviation 15 marks. Estimate the proportion of students with 
marks below 35. 

4) The marks in Economics obtained by the students in Public 
examination is assumed to be approximately normally distributed 
with mean 45 and S.D 3. A student taking this subject is chosen 
at random. What is the probability that his mark is above 70? 

5) Assuming the mean height of soldiers to be 68.22 inches with 
a variance 10.8 inches. How many soldiers in a regiment of 
1000 would you expect to be over 6 feet tall ? 

6) The mean yield for one-acre plot is 663 kgs with a S.D 32 
kgs. Assuming normal distribution, how many one-acre plot 
in a batch of 1000 plots would you expect to have yield (i) 
over 700 kgs (ii) below 650 kgs. 

7) A large number of measurements is normally distributed with 
a mean of 65.5" and S.D of 6.2". Find the percentage of 
measurements that fall between 54.8" and 68.8". 
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8) The diameter of shafts produced in a factory conforms to 
normal distribution. 31% of the shafts have a diameter less 
than 45mm. and 8% have more than 64mm. Find the mean 
and standard deviation of the diameter of shafts. 

9) The results of a particular examination are given below in a 
summary form. 



Result 


percentage of candidates 


1 . passed with distinction 

2. passed 

3. failed 


10 
60 
30 



It is known that a candidate gets plucked if he obtained less 
tham 40 marks out of 100 while he must obtain atleast 75 
marks in order to pass with distinction. Determine the mean 
and the standard deviation of the distribution assuming this 
to be normal. 



EXERCISE 8.5 
Choose the correct answer 



1) 



If a fair coin is tossed three times the probability function p(x) 
of the number of heads x is 



(a) 


X 





1 


2 


3 




p(x) 


1 

8 


1 

8 


2 
8 


3 
8 


(c) 


X 





1 


2 


3 




p(x) 


1 

8 


1 

8 


2 
8 


3 
8 



(b) 



(d) none of these 



X 





1 


2 


3 


p(x) 


1 

8 


3 
8 


3 
8 


1 

8 



2) If a discrete random variable has the probability mass function as 



then the value of k is 



X 





1 


2 


3 


p(x) 


k 


2k 


3k 


5k 



(a) 



11 



(b) 



11 



(c) 



11 



(d) 



11 
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3) If the probability density function of a variable X is defined as 
f(x) = Cc (2 -x), < x < 2 then the value of C is 

(a) | (b) | (c) | (d) j- 

4) The mean and variance of a binomial distribution are 
(a) np, npq (b) pq, npq (c)np, ^jnpq (d)np,nq 

5) If X~N (|a, a), the standard Normal variate is distributed as 
(a) N(0, 0) (b) N(l, 0) (c) N(0, 1) (d) N(l, 1) 

6) The normal distribution curve is 

(a) Bimodal (b) Unimodal 

(c) Skewed (d) none of these 

7) If X is a poission variate with P(X = 1) = P(X = 2), the mean 
of the Poisson variate is equal to 

(a)l (b)2 (c)-2 (d)3 

8) The standard deviation of a Poissson variate is 2, the mean of 
the poisson variate is 

(a) 2 (b) 4 (c) 4l (d) -jf 

9) The random variables X and Y are independent if 

(a) E(X Y) = 1 (b) E(XY) = 

(c) E(X Y) = E(X) E(Y) (d) E(X+Y) = E(X) + E(Y) 

10) The mean and variance of a binomial distribution are 8 and 4 
respectively. Then P(X = 1) is equal to 

(a) ^n (b) ^r (c) ^ (d) ^To 

1 1) If X~N (\i, a 2 ), the points of inflection of normal distribution 
curve are 

(a) ±\i (b) \i ± a (c) a + \i (d) \i ± 2a 

12) If X~N (\i, a 2 ), the maximum probability at the point of 
inflection of normal distribution is 

(a) 



1 c * 


(b)" 


■42% 


1 


1 


~j2n 


w a-s/2^ 


W Jin 
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13) If a random variable X has the following probability distribution 

X -1-2 12 

p(x) ;r- — — — then the expected value of X is 

(a) | (b) \ (c) \ (d) \ 

14) If X~N (5, 1), the probability density function for the normal 
variate X is 

i -W) 2 i -W' )2 

1 -i(*-5) 2 1 -l(*-5) 2 

< c > IT/ W ^T 

15) If X~N (8, 64), the standard normal variate Z will be 

, v X-64 ., s X-8 

(a)z = -g- (b)-^- 

^ X-8 ,a\^z* 

< c )-g- (d) V8 
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SAMPLING TECHNIQUES 
AND STATISTICAL INFERENCE 



9 



9.1 SAMPLING AND TYPES OF ERRORS 

Sampling is being used in our everyday life without knowing 
about it. For examples, a cook tests a small quantity of rice to see 
whether it has been well cooked and a grain merchant does not 
examine each grain of what he intends to purchase, but inspects 
only a small quantity of grains. Most of our decisions are based on 
the examination of a few items only. 

In a statistical investigation, the interest usually lies in the 
assessment of general magnitude and the study of variation with 
respect to one or more characteristics relating to individuals 
belonging to a group. This group of individuals or units under study 
is called population or universe. Thus in statistics, population is 
an aggregate of objects or units under study. The population may 
be finite or infinite. 

9.1.1 Sampling and sample 

Sampling is a method of selecting units for analysis such as 
households, consumers, companies etc. from the respective 
population under statistical investigation. The theory of sampling is 
based on the principle of statistical regularity. According to 
this principle, a moderately large number of items chosen at random 
from a large group are almost sure on an average to possess the 
characteristics of the larger group. 

A smallest non-divisible part of the population is called a unit. 
A unit should be well defined and should not be ambiguous. For 
example, if we define unit as a household, then it should be defined 
that a person should not belong to two households nor should it 
leave out persons belonging to the population. 

A finite subset of a population is called a sample and the 
number of units in a sample is called its sample size. 
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By analysing the data collected from the sample one can draw 
inference about the population under study. 

9.1.2 Parameter and Statistic 

The statistical constants of a population like mean (|i), variance 
(a 2 ), proportion (P) are termed as parameters. Statistical 
measures like mean (X), variance (s 2 ), proportion (p) computed 
from the sampled observations are known as statistics. 

Sampling is employed to throw light on the population 
parameter. A statistic is an estimate based on sample data to draw 
inference about the population parameter. 

9.1.3 Need for Sampling 

Suppose that the raw materials department in a company 
receives items in lots and issues them to the production department 
as and when required. Before accepting these items, the inspection 
department inspects or tests them to make sure that they meet the 
required specifications. Thus 

(i) it could inspect all items in the lot or 

(ii) it could take a sample and inspect the sample for defectives 
and then estimate the total number of defectives for the 
population as a whole. 

The first approach is called complete enumeration (census). 

It has two major disadvantages namely, the time consumed and the 
cost involved in it. 

The second approach that uses sampling has two major 
advantages, (i) It is significantly less expensive, (ii) It takes least 
possible time with best possible results. 

There are situations that involve destruction procedure where 
sampling is the only answer. A well-designed statistical sampling 
methodology would give accurate results and at the same time will 
result in cost reduction and least time. Thus sampling is the best 
available tool to decision makers. 
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9.1.4 Elements of Sampling Plan 

The main steps involved in the planning and execution of sample 
survey are : 

(i) Objectives 

The first task is to lay down in concrete terms the basic 
objectives of the survey. Failure to define the objective(s) will clearly 
undermine the purpose of carrying out the survey itself. For example, 
if a nationalised bank wants to study savings bank account holders 
perception of the service quality rendered over a period of one 
year, the objective of the sampling is, here,to analyse the perception 
of the account holders in the bank. 

(ii) Population to be covered 

Based on the objectives of the survey, the population should 
be well defined. The characteristics concerning the population under 
study should also be clearly defined. For example, to analyse the 
perception of the savings bank account holders about the service 
rendered by the bank, all the account holders in the bank constitute 
the population to be investigated. 

(iii) Sampling frame 

In order to cover the population decided upon, there should 
be some list, map or other acceptable material (called the frame) 
which serves as a guide to the population to be covered. The list or 
map must be examined to be sure that it is reasonably free from 
defects. The sampling frame will help us in the selection of sample. 
All the account numbers of the savings bank account holders in the 
bank are the sampling frame in the analysis of perception of the 
customers regarding the service rendered by the bank. 

(iv) Sampling unit 

For the purpose of sample selection, the population should 
be capable of being divided up into sampling units. The division of 
the population into sampling units should be unambiguous. Every 
element of the population should belong to just one sampling unit. 
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Each account holder of the savings bank account in the bank, form 
a unit of the sample as all the savings bank account holders in the 
bank constitute the population. 

(v) Sample selection 

The size of the sample and the manner of selecting the sample 
should be defined based on the objectives of the statistical 
investigation. The estimation of population parameter along with 
their margin of uncertainity are some of the important aspects to be 
followed in sample selection. 

(vi) Collection of data 

The method of collecting the information has to be decided, 
keeping in view the costs involved and the accuracy aimed at. 
Physical observation, interviewing respondents and collecting data 
through mail are some of the methods that can be followed in 
collection of data. 

(vii) Analysis of data 

The collected data should be properly classified and subjected 
to an appropriate analysis. The conclusions are drawn based on 
the results of the analysis. 

9.1.5 Types of Sampling 



Types of 
Sampling 



Probability Sampling 

or 

Random sampling 



Non-Probability Sampling 

or 

Non-Random Sampling 



Simple 
Random 
Sampling 



Stratified 
Random 
Sampling 



Systematic 
Sampling 



Cluster 
Sampling 



Convenience 
Sampling 



Expert 
Sampling 



Quota 
Sampling 
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The technique of selecting a sample from a population usually 
depends on the nature of the data and the type of enquiry. The 

procedure of sampling may be broadly classified under the following 
heads : 

(i) Probability sampling or random sampling and 

(ii) Non-probability sampling or non-random sampling. 

(i) Probability sampling 

Probability sampling is a method of sampling that ensures that 
every unit in the population has a known non-zero chance of being 
included in the sample. 

The different methods of random sampling are : 

(a) Simple Random Sampling 

Simple random sampling is the foundation of probability 
sampling. It is a special case of probability sampling in which every 
unit in the population has an equal chance of being included in a 
sample. Simple random sampling also makes the selection of every 
possible combination of the desired number of units equally likely. 
Sampling may be done with or without replacement. 

It may be noted that when the sampling is with replacement, 
the units drawn are replaced before the next selection is made.The 
population size remains constant when the sampling is with 
replacement. 

If one wants to select n units from a population of size N 
without replacement,then every possible selection of n units must 
have the same probability. Thus there are N c n possible ways to 
pick up n units from the population of size N. Simple random 
sampling guarantees that a sample of n units, has the same probability 

n of being selected. 

Example 

A bank wants to study the Savings Bank account holders 
perception of the service quality rendered over a period of one 
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year. The bank has to prepare a complete list of savings bank 
account holders, called as sampling frame, say 500. Now the 
process involves selecting a sample of 50 out of 500 and interviewing 
them. This could be achieved in many ways. Two common ways 
are : 

(1) Lottery method : Select 50 slips from a box containing well 
shuffled 500 slips of account numbers without replacement. 
This method can be applied when the population is small 
enough to handle. 

(2) Random numbers method : When the population size is 
very large, the most practical and inexpensive method of 
selecting a simple random sample is by using the random 
number tables. 

(b) Stratified Random Sampling 

Stratified random sampling involves dividing the population 
into a number of groups called strata in such a manner that the 
units within a stratum are homogeneous and the units between the 
strata are hetrogeneous . The next step involves selecting a simple 
random sample of appropriate size from each stratum. The sample 
size in each stratum is usually of (a) equal size, (b) proportionate 
to the number of units in the stratum. 

For example, a marketing manager in a consumer product 
company wants to study the customer's attitude towards a new 
product in order to improve the sales. Then three typical cities that 
will influence the sales will be considered as three strata. The 
customers within a city are similar but between the cities are vastly 
different. Selection of the customers for the study from each city 
has to be a random sample to draw meaningful inference on the 
whole population. 

(c) Systematic sampling 

Systematic sampling is a convenient way of selecting a sample. 
It requires less time and cost when compared to simple random 
sampling. 
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In this method, the units are selected from the population at a 
uniform interval. To facilitate this we arrange the items in numerical, 
alphabetical, geographical or any other order. When a complete 
list of the population is available, this method is used. 

If we want to select a sample of size n from a population of 

size N under systematic sampling, frist select an item j at random 

N 

such that 1 < / < k where k = and k is the nearest possible 

_ m + 1 r 

integer. Then j,j + k, j + 7k , ... ,j + (n-l) k ' items constitute a 

systematic random sample. 

For example, if we want to select a sample of 9 students out 
of 105 students numbered as 1, 2, ..., 105 , select a student among 

1, 2, ... , 1 1 at random (say at 3rd position). Here k = -rrr = 10.5 

and .-. k = 11. Hence students at the positions 3, 14, 25, 36, 47, 
58, 69, 80, 91 form a random sample of size 9. 

(d) Cluster sampling 

Cluster sampling is used when the population is divided into 
groups or clusters such that each cluster is a representative of the 
population. 

If a study has to be done to find out the number of children 
that each family in Chennai has, then the city can be divided into 
several clusters and a few clusters can be chosen at random. Every 
family in the chosen clusters can be a sample unit. 

In using cluster sampling the following points should be noted 

(a) For getting precise results clusters should be as small as 
possible consistent with the cost and limitations of the survey 
and 

(b ) The number of units in each cluster must be more or less equal. 

(ii) Non-Probability Sampling 

The fundamental difference between probability sampling and 
non-probability sampling is that in non-probability sampling 
procedure, the selection of the sample units does not ensure a known 
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chance to the units being selected. In other words the units are 
selected without using the principle of probability. Even though 
the non-probability sampling has advantages such as reduced cost, 
speed and convenience in implementation, it lacks accuracy in view 
of the seleciton bias. Non-probability sampling is suitable for pilot 
studies and exploratory research 

The methods of non-random sampling are : 

(a) Purposive sampling 

In this sampling, the sample is selected with definite purpose 
in view and the choice of the sampling units depends entirely on the 
discretion and judgement of the investigator. 

For example, if an investigator wants to give the picture that 
the standard of living has increased in the city of Madurai, he may 
take the individuals in the sample from the posh localities and ignore 
the localities where low income group and middle class families live. 

(b) Quota sampling 

This is a restricted type of purposive sampling. This consists 
in specifying quotas of the samples to be drawn from different groups 
and then drawing the required samples from these groups by 
purposive sampling. Quota sampling is widely used in opinion and 
market research surveys. 

(c) Expert opinion sampling or expert sampling 

Expert opinion sampling involves gathering a set of people 
who have the knowledge and expertise in certain key areas that are 
crucial to decision making. The advantage of this sampling is that it 
acts as a support mechanism for some of our decisions in situations 
where virtually no data are available. The major disadvantage is 
that even the experts can have prejudices, likes and dislikes that 
might distort the results. 

9.1.6 Sampling and non-sampling errors 

The errors involved in the collection of data, processing and 
analysis of data may be broadly classfied as (i) sampling errors 
and (ii) non-sampling errors. 
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(i) Sampling errors 

Sampling errors have their origin in sampling and arise due to 
the fact that only a part of the population has been used to estimate 
population parameters and draw inference about the population. 
Increasing in the sample size usually results in decrease in the 
sampling error. 

Sampling errors are primarily due to some of the following 
reasons : 

(a) Faulty selection of the sample 

Some of the bias is introduced by the use of defective sampling 
technique for the selection of a sample in which the investigator 
deliberately selects a representative sample to obtain certain results. 

(b) Substitution 

If difficulty arise in enumerating a particular sampling unit 
included in the random sample, the investigators usually substitute a 
convenient member of the population leading to sampling error. 

(c) Faulty demarcation of sampling units 

Bias due to defective demarcation of sampling units is 
particularly significant in area surveys such as agricultural 
experiments. Thus faulty demarcation could cause sampling error. 

(ii) Non-sampling errors 

The non-sampling errors primarily arise at the stages of 
observation, classification and analysis of data. 

Non-sampling errors can occur at every stage of the planning 
or execution of census or sample surveys. Some of the more 
important non-sampling errors arise from the following factors : 

(a) Errors due to faulty planning and definitions 

Sampling error arises due to improper data specification, error 
in location of units, measurement of characteristics and lack 
of trained investigators. 

119 



(b) Response errors 

These errors occur as a result of the responses furnished by 
the respondents. 

(c) Non-response bias 

Non-response biases occur due to incomplete information on 
all the sampling units. 

(d) Errors in coverage 

These errors occur in the coverage of sampling units. 

(e) Compiling errors 

These errors arise due to compilation such as editing and 
coding of responses. 

EXERCISE 9.1 

1) Explain sampling distribution and standard error. 

2) Distinguish between the terms parameter and statistic 

3) Explain briefly the elements of sampling plan. 

4) Discuss probability sampling. 

5) Discuss non-probability sampling. 

6) Distinguish between sampling and non sampling errors. 

9.2 SAMPLING DISTRIBUTIONS 

Consider all possible samples of size n which can be drawn 
from a given population. For each sample we can compute a statistic 
such as mean, standard deviation, etc. which will vary from sample 
to sample. The aggregate of various values of the statistic under 
consideration may be grouped into a frequency distribution. This 
distribution is known as sampling distribution of the statistic. Thus 
the probability distribution of all the possible values that a sample 
statistic can take, is called the sampling distribution of the statistic. 
Sample mean and sample proportion based on a random sample 
are examples of sample statistic. 

Supposing a Market Research Agency wants to estimate the 
annual household expenditure on consumer durables from among 
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the population of households (say 50000 households) in Tamil Nadu. 
The agency can choose fifty different samples of 50 households 
each. For each of the samples, we can calculate the mean annual 
expenditure on consumer durables as given in the following table : 



Sample No. 


Total expenditure for 
50 households 


Mean 
Rs. 


1 

2 
3 
4 

49 
50 


100000 
300000 
200000 
150000 

600000 
400000 


2000 
6000 
4000 
3000 

12000 
8000 



The distribution of all the sample means is known as the 
sampling distribution of the mean. The figures Rs. 2000, 
6000 ... 8000 are the sampling distribution of the means. 

Similarly, the sampling distribution of the sample variance 
and sample proportion can also be obtained. 

In a sample of n items if n x belongs to Category- 1 and 

Tl 

n-n. belongs to the Category-2, then — is defined as the 



n—n 



sample proportion p belonging to the first category and 

n 

or (1-p) is the sample proportion of second category. This concept 
could be extended to k such categories with proportions (say) p v 
p v ... ,p k such that p j + p 2 + ■■■ +P k = 1 

We could also arrive at a sampling distribution of a proportion. 

For example, if in a factory producing electrical switches, 15 different 
samples of 1000 switches are taken for inspection and number of 
defectives in each sample could be noted. We could find a probability 
distribtuion of the proportion of defective switches. 
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9.2.1 Sampling distribution of the Mean from normal 
population 

If X p Xj, ..., X n are n independent random samples drawn 
from a normal population with mean (I and standard deviation o, 
then the sampling distribution of X(the sample mean) follows a 

normal distribution with mean (I and standard deviation —j= . 

It may be noted that 

_ EX. X,+X 2 +...+ X 
(1) the sample mean X = - 



_ n n 

Thus X is a random variable and will be different every time 

when a new sample of n observations are taken 

(ii) X is an unbiased estimator of the population mean (I. 
i.e. E(X ) = (I, denoted by (i_ = (I. 

(iii) the standard deviation of the sample mean X is given by 

x \/n 
For example, consider a sample of weights of four boys from 
the normal population of size 10000 with replacement. The mean 
weight of the four boys is worked out. Again take another new 
sample of four boys from the same population and find the mean 
weight. If the process is repeated an infinite number of times, the 
probability distribution of these infinite number of sample means 
would be sampling distribution of mean. 

9.2.2 Central limit theorem 

When the samples are drawn from a normal population 
with mean (I and standard deviation o, the sampling distribution 

of the mean is also normal with mean u_ = u and standard 

x 

deviation o ^ = — t= . However the sampling distribution of the 
mean, when the population is not normal is equally important. 
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The central limit theorem says that from any given 
population with mean (I and standard deivation o, if we draw a 
random sample of n observations, the sampling distribution of the 
mean will approach a normal distribution with a mean (I and standard 

deviation — j=as the sample size increases and becomes large. 

In practice a sample size of 30 and above is considered to be 
large. 

Thus the central limit theorem is a hall mark of statistical 
inference. It permits us to make inference about the population 
parameter based on random samples drawn from populations that 
are not neccessarily normally distributed. 

9.2.3 Sampling distribution of proportions 

Suppose that a population is infinite and that the probability 
of occurrence of an event, say success, is P. Let Q = 1 - P denote 
the probability of failure. 

Consider all possible samples of size n drawn from this 
population. For each sample, determine the proportion p of 
successes. Applying central limit theorem, if the sample size n is 
large, the distribution of the sample porportion p follows a normal 

distribution with mean ll = P and S.D a = ILS. . 

V n 

9.2.4 Standard error 

The standard deviation of the sampling distribution of a statistic 
is called the standard errorof the statistic. The standard deviation 
of the distribution of the sample means is called the standard error 
of the mean Likewise, the standard deviation of the distribution 
of the sample proportions is called the standard error of the 
proportion. 

The standard error is popularly known as sampling error. 
Sampling error throws light on the precision and accuracy of the 
estimate. The standard error is inversely proportional to the sample 
size i.e. the larger the sample size the samller the standard error. 
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The standard error measures the dispersion of all possible 
values of the statistic in repeated samples of a fixed size from a 
given population. It is used to set up confidence limits for population 
parameters in tests of significance. Thus the standard errors of 
sample mean X and sample proportion p are used to find 
confidence limits for the population mean \x and the population 



Droportion F re 


spectivery. 




Statistic 


Standard error 


Remarks 


Sample mean 
X 


a 
4n 


Population size is 
infinite or sample with 
replacement. 




Population size N finite 
or sample without 
replacement 


o JN-n 


Sample 
proportion p 


Ipq 

1 n 


Population size is 
infinite or sample with 
replacement. 




Population size N finite 
or sample without 
replacement 


/PQ |N-/i 
V n V N-1 



Example 1 

A population consists of the five numbers 2, 3, 6, 8, 11. 
Consider all possible samples of size 2 which can be drawn 
with replacement from this population, find (i) mean of the 
population, (i) the standard deviation of the population (iii) 
the mean of the sample distribution of means and (iv) the 
standard error of means. 



Solution : 

(i) The population mean (I = 



Ix ^ 2+3 + 6 + 8 + 11 

N 



= 6 



(ii) The variance of the population a 
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J_ 

N 



EOt - |l) 2 



= j {(2-6) 2 + (3-6) 2 + (6-6) 2 + (8-6) 2 + (11-6) 2 } 

= 10.8 

.-. the standard deviation of the population o = 3.29 

(iii) There are 25 samples of size two which can be drawn with 
replacement. They are 

(2,2) (2,3) (2,6) (2,8) (2,11) 

(3,2) (3,3) (3,6) (3,8) (3,11) 

(6,2) (6,3) (6,6) (6,8) (6,11) 

(8,2) (8,3) (8,6) (8,8) (8,11) 

(11,2) (11,3) (11,6) (11,8) (11,11) 

The corresponding sample means are 

2.0 2.5 4.0 5.0 6.5 

2.5 3.0 4.5 5.5 7.0 

4.0 4.5 6.0 7.0 8.5 

5.0 5.5 7.0 8.0 9.5 

6.5 7.0 8.5 9.5 11.0 

The mean of sampling distribution of means 

sum of all sample means 150 
^x = 25 = ^5 =6 -° 

(iv) The variance o 2 ^ of the sampling distribution of means is 
obtained as follows : 

o| =^ {2-6) 2 + (2.5 - 6) 2 + ... + (6.5-6) 2 + ... 
+ (9.5-6) 2 +(ll-6) 2 } 

-135-5 4 

~ 25 

.-. the standard error of means a_ = V5.4 = 2.32 

x 
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Example 2 

Assume that the monthly savings of 1000 employees 
working in a factory are normally distributed with mean 
Rs. 2000 and standard deviation Rs. 50 If 25 samples 
consisting of 4 employees each are obtained, what would be 
the mean and standard deviation of the resulting sampling 
distribution of means if sampling were done (i) with 
replacement, (ii) without replacement. 

Solution : 

Given N = 1000, u, = 2000, o = 50, n = 4 

(i) Sampling with replacement 

u _= m = 2000 

x 

a 50 os: 
x ~Jn a/4 
(ii) Sampling without replacement 

The mean of the sampling distribution of the means is 

u _ = m = 2000 

x 

The standard deviation of the sampling distribution of means 
is 



a N-n 



°x"^N-l 



J50 / 100-4 



74 "V 1000-1 

= (25) Jf = 25 (^96) 
= (25) (0.9984) = 24.96 



Example 3 

A random sample of size 5 is drawn without replacement 
from a finite population consisting of 41 units. If the population 
S.D is 6.25, find the S.E of the sample mean. 
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Solution : 

Population size N = 41 

Sample size n = 5 

Standard deviation of the popultion o = 6.25 



S.E of sample mean = — t= I j- (N is finite) 

\/n V N — 1 



6.25 /41-5 

1 v A K 

=2.65 



■S V 41-1 

6.25 x 6 3 x 6.25 



-Js 2V10 5V2 

Example 4 

The marks obtained by students in an aptitude test are 
normally distributed with a mean of 60 and a standard 
deviation of 30. A random sample of 36 students is drawn 
from this population. 

(i) What is the standard error of the sampling mean? 

(ii) What is the probability that the mean of a sample of 
16 students will be either less than 50 or greater than 80? 

[ P (0 < Z < 4) = 0.4999 ] 

Solution : 

(i) The standard error of the sample mean X is given by 

o ? = —j== —j= = 5 ( N is not given) 
x ^ a/36 B 

(ii) The random variable X follows normal distribution with mean 
u v and standard deviation — t=. 

To find P( X <50 or X >80). 

P(X<50 or X>80) = P(X<50) + P(X>80) 

X-1% . 80-60 



(w .. ™ ™A fv .. ™ ™A 



= P 



X-1% < 50-60 



+ P 



<% 



> 
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= P (Z < -2) + P(Z > 4) 
= [0.5 - P(0 < Z < 2)] + [0.5 - P(0 < Z < 4)] 
= (0.5 - 0.4772) + (0.5 - 0.4999) 
= .02283, which is the required probability. 
Example 5 

2% of the screws produced by a machine are defective. 
What is the probability that in a consignment of 400 such 
screws, 3% or more will be defective. 

Solution : 

Here N is not given, but n = 400 

Population proportion P = 2% = 0.02 .-. Q = 1 - P = 0.98 

The sample size is large 

The sample porportion is normally distributed with mean 



,, nm ^ct^ P Q 0-02x0.98 n nn „ 
\i p = 0.02 and S.D =^ = ^ ^ = 0.007 

Probability that the sample proportion p > 0.03 

= Area under the normal curve to the right of Z = 1.43. 

(7 - P~ P 0-03-0.02 _ , .,. 
{ S.D " 0.007 _ lA5) 
.-. required probability = 0.5 - Area between Z = to Z = 1.43 

= 0.5-0.4236 = 0.0764 

EXERCISE 9.2 

1) A population consists of four numbers 3, 7, 1 1 and 15. Consider 
all possible samples of size two which can be drawn with 
replacement from this population. 
Find (i) the population mean 

(ii) the population standard deviation, 
(iii) the mean of the sampling distribution of mean 
(iv) the standard deviation of the sampling distribution of 
mean. 

128 



2) A population consists of four numbers 3, 7, 1 1 and 15. Consider 
all possible samples of size two which can be drawn without 
replacement from this population. 

Find (i) the population mean 

(ii) the population standard deviation. 

(iii) the mean of the sampling distribution of mean 

(iv) the standard deviation of the sampling distribution of 
mean. 

3) The weights of 1500 iron rods are normally distributed with 
mean of 22.4 kgs. and standard deviation of 0.048 kg. If 300 
random samples of size 36 are drawn from this population, 
determine the mean and standard deviation of the sampling 
distribution of mean when sampling is done (i) with replacement 
(ii) without replacement. 

4) 1% of the outgoing +2 students in a school have joined I.I.T. 
Madras. What is the probability that in a group of 500 such 
students 2% or more will be joining I.I.T. Madras. 

9.3 ESTIMATION 

The technique used for generalising the results of the sample 
to the population is provided by an important branch of statistics 
called statistical inference. The concept of statistical inference 
deals with two basic aspects namely (a) Estimation and (b) Testing 
of hypothesis. 

In statistics, estimation is concerned with making inference 
about the parameters of the population using information available 
in the samples. The parameter estimation is very much needed in 
the decision making process. 

The estimation of population parameters such as mean, 
variance, proportion, etc. from the correspoinding sample statistics 
is an important function of statistical inference. 
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9.3.1 Estimator 

A sample statistic which is used to estimate a population 
parameter is known as estimator. 

A good estimator is one which is as close to the true value of 
population parameter as possible. A good estimator possesses the 
following properties: 

(i) Unbiasedness 

As estimate is said to be unbiased if its expected value is 
equal to its parameter. 

The sample mean X = — Zx is an unbiased estimator of 
population mean \i. For a sample of sizen, drawn from a population 

of size N, s 2 = -r E(jc— jc) 2 is an unbiased estimator of 

n-\ 

population variance. Hence s 2 is used in estimation and in testing 
of hypothesis. 

(ii) Consistency 

An estimator is said to be consistent if the estimate tends to 
approach the parameter as the sample size increases. 

(iii) Efficiency 

If we have two unbiased estimators for the same population 
prarameter, the first estimator is said to be more efficient than the 
second estimator if the standard error of the first estimator is smaller 
than that of the second estimator for the same sample size. 

(iv) Sufficiency 

If an estimator possesses all information regarding the 
parameter, then the estimator is said to be a sufficient estimator. 

9.3.2 Point Estimate and Interval Estimate 

It is possible to find two types of estimates for a population 
parameter. They are point estimate and interval estimate. 

Point Estimate 

An estimate of a population parameter given by a single 
number is called a point estimator of the parameter. Mean ( x ) and 
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1 _ 2 

the sample variance [s 2 = r E(x- x ) ] are the examples of 

point estimates. 

A point estimate will rarely coincide with the true population 
parameter value. 

Interval Estimate 

An estimate of a population parameter given by two numbers 
between which the parameter is expected to lie is called an interval 
estimate of the parameter. 

Interval estimate indicates the accuracy of an estimate and is 
therefore preferable to point estimate. As point estimate provides 
a single value for the population parameter it may not be suitable in 
some situation. 

For example, 

if we say that a distance is measured as 5.28mm, we are giving 
a point estimate. On the other hand, if we say that the distance is 
5.28 + 0.03 mm i.e. the distance lies between 5.25 and 5.31mm, 
we are giving an interval estimate. 

9.3.3 Confidence Interval for population mean and proportion 

The interval within which the unknown value of parameter is 
expected to lie is called confidence interval. The limits so 
determined are called confidence limits. 

Confidence intervals indicate the probability that the population 
parameter lies within a specified range. 

Computation of confidence interval 

To compute confidence interval we require 
(i) the sample statistic, 

(ii) the standard error (S .E) of sampling distribution of the statistic 
(iii) the degree of accuracy reflected by the Z- value. 

If the size of sample is sufficiently large, then the sampling 
distribution is approximately normal. Therefore, the sample value 
can be used in estimation of standard error in the place of population 
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value. The Z-distribution is used in case of large samples to estimate 
the confidence limits. 



levels. 



We give below values of Z corresponding to some confidence 



Confidence Levels 


99% 


98% 


96% 


95% 


80% 


50% 


Value of Z, Z 

c 


2.58 


2.33 


2.05 


1.96 


1.28 


0.674 



(i) Confidence interval estimates for means 

Let (I and a be the population mean and standard deviation 
of the population. 

Let X and s be the sample mean and standard deviation of 
the sampling distribution of a statistic. 

The confidence limits for \l are given below : 



Population 
size 


Sample size 


confidence limits for (I. 


Infinite 


n 


X + (Z ) —7= , z is the value 
Vn 
of Z corresponding to 

confidence levels. 


Finite, N 


n 




W±^ 



(ii) Confidence intervals for proportions 

If p is the proportion of successes in a sample of size n 
drawn from a population with P as its proportion of successes, 
then the confidence intervals for P are given below : 



Population 


Sample size 


Confidence limits for P 


Infinite 


n 


I PQ 

p±< z c) iv 


Finite, N 


n 


. 


I PQ /N-n 

p ±(z cWvVn-i 
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Example 6 

Sensing the downward trend in demand for a leather 
product, the financial manager was considering shifting his 
company's resources to a new product area. He selected a 
sample of 10 firms in the leather industry and discovered their 
earnings (in %) on investment. Find point estimate of the 
mean and variance of the population from the data given 
below. 



21.0 25.0 20.0 

Solution : 



16.0 12.0 10.0 17.0 18.0 13.0 11.0 



X 


X 


XX 


(X-X) 2 


21.0 


16.3 


4.7 


22.09 


25.0 


16.3 


8.7 


75.69 


20.0 


16.3 


3.7 


13.69 


16.0 


16.3 


-0.3 


0.09 


12.0 


16.3 


-4.3 


18.49 


10.0 


16.3 


-6.3 


39.69 


17.0 


16.3 


0.7 


0.49 


18.0 


16.3 


1.7 


2.89 


13.0 


16.3 


-3.3 


10.89 


11.0 


16.3 


-5.3 


28.09 


163.0 






212.10 



c i v £X _ 163 
Sample mean, X = "TfT 



16.3 



Sample variance, s 2 



— ^E(X-X) 
212.10 



n-l 



Sample standard deviation = V23.5 
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= 23.5 (the sample size is small) 
4.85 



Thus the point estimate of mean and of variance of the 
population from which the samples are drawn are 16.3 and 23.5 
respectively. 

Example 7 

A sample of 100 students are drawn from a school. 
The mean weight and variance of the sample are 67.45 kg 
and 9 kg. respectively. Find (a) 95% and (b) 99% 
confidence intervals for estimating the mean weight of the 
students. 

Solution : 

Sample size, n = 100 
The sample mean, X= 67.45 
The sample variance s 2 = 9 
The sample standard deviation s = 3 
Let (I be the population mean. 

(a) The 95% confidence limits for (I are given by 

=> 67.45 ± (1.96) -j= (Here Z = 1.96 for 95% 

confidence level) 
=> 67.45 ±0.588 

Thus the 95% confidence intervals for estimating (I is given by 
(66.86, 68.04) 

(b) The 99% confidence limits for estimating (I are given by 

=> 67.45 ± (2.58) -7= (Here 2 = 2.58 for 99% 

v '" confidence level) 

=> 67.45 ±0.774 

Thus the 99% confidence interval for estimating \x is given by 

(66.67, 68.22) 
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Example 8 

A random sample of size 50 with mean 67.9 is drawn 
from a normal population. If it is known that the standard error 
of the sample mean is VOJ , find 95% confidence interval for 
the population mean. 

Solution : 

n = 50, sample mean X = 67.9 

95% confidence limits for population mean (I are : 

X±(Z){S.E(X)} 

=> 67.9 ± (1.96) (V07) 
=> 67.9 ±1.64 

Thus the 95% confidence intervals for estimating (I is given by 
(66.2, 69.54) 

Example 9 

A random sample of 500 apples was taken from large 
consignment and 45 of them were found to be bad. Find the 
limits at which the bad apples lie at 99% confidence level. 

Solution : 

We shall find confidence limits for the proportion of bad apples. 

Sample size n = 500 

Proportion of bad apples in the sample = — — =0.09 

p = 0.09 

.-. Proportion of good apples in the sample q = l-p = 0.91. 

The confidence limits for the population proportion P of bad 
apples are given by 

P ± (Z ) 




v 
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=> 0.09 + (2.58) / CggXO^l _> 0.09 + 0.033 
V 500 

The required interval is (0.057, 0.123) 

Thus, the bad apples in the consignment lie between 5.7% 
and 12.3% 

Example 10 

Out of 1000 TV viewers, 320 watched a particular 
programme. Find 95% confidence limits for TV viewers who 
watched this programme. 

Solution : 

Sample size n = 1000 

Sample proportion of TV viewers p = — = " ~ 



n 1000 




n 
= 0.0147 

The 95% confidence limits for population propotion P are 
given by 

p±(1.96) S.E (p) = 0.32 ±0.028 

=> 0.292 and 0.348 

.-. TV viewers of this programme lie between 29.2% and 
34.8% 

Example 11 

Out of 1500 school students, a sample of 150 selected 
at random to test the accuracy of solving a problem in business 
mathematics and of them 10 did a mistake. Find the limits 
within which the number of students who did the problem 
wrongly in whole universe of 1500 students at 99% confidence 
level. 
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Solution : 

Population size, 
Sample size, 

Sample proportion, 



Standard error of p, SE (p) = J^L = 0.02 

V n 
The 99% confidence limits for population proportion P are 
given by 



N = 


■- 1500 




n = 


:150 




P = 


10 

150 


= 0.07 


q = 


\-p = 


0.93 




N- 



0.07 + (2.58) (0.02) ^^If 
=> 0.07 ±0.048 
.-. The confidence interval for P is (0.022, 0.118) 

.-. The number of students who did the problem wrongly 
in the population of 1500 lies between .022 x 1500 = 33 and 
.118 x 1500= 177. 

EXERCISE 9.3 

1) A sample of five measurements of the diameter of a sphere 
were recorded by a scientist as 6.33, 6.37, 6.36, 6.32 and 6.37 
mm. Determine the point estimate of (a) mean, (b) variance. 

2) Measurements of the weights of a random sample of 200 ball 
bearings made by a certain machine during one week showed 
mean of 0.824 newtons and a standard deviation of 0.042 
newtons. Find (a) 95% and (b) 99% confidence limits for the 
mean weight of all the ball bearings. 

3) A random sample of 50 branches of State Bank of India out of 
200 branches in a district showed a mean annual profit of Rs.75 
lakhs and a standard deviation of 10 lakhs. Find the 95% 
confidence limits for the estimate of mean profit of 200 
branches. 
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4) A random sample of marks in mathematics secured by 50 
students out of 200 students showed a mean of 75 and a 
standard deviation of 10. Find the 95% confidence limits for 
the estimate of their mean marks. 

5) Out of 10000 customer's ledger accounts, a sample of 200 
accounts was taken to test the accuracy of posting and 
balancing wherein 35 mistakes were found. Find 95% 
confidence limits within which the number of defective cases 
can be expected to lie. 

6) A sample poll of 100 voters chosen at random from all voters 
in a given district indicated that 55% of them were in favour of 
a particular candidate. Find (a) 95% confidence limits, (b) 
99% confidence limits for the proportion of all voters in favour 
of this candidate. 

9.4 HYPOTHESIS TESTING 

There are many problems in which, besides estimating the 
value of a parameter of the population, we must decide whether a 
statement concerning a parameter is true or false; that is, we must 
test a hypothesis about a parameter. 

To illustrate the general concepts involved in this kind of 
decision problems, suppose that a consumer protection agency wants 
to test a manufacturer's claim that the average life time of electric 
bulbs produced by him is 200 hours. So it instructs a member of its 
staff to take 50 electric bulbs from the godown of the company and 
test them for their lifetime continuously with the intention of rejecting 
the claim if the mean life time of the bulbs is below 1 80 hours (say) ; 
otherwise it will accept the claim. 

Thus hypothesis is an assumption that we make about an 
unknown population parameter. We can collect sample data from 
the population, arrive at the sample statistic and then test if the 
hypothesis about the population parameter is true. 
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9.4.1 Null Hypothesis and Alternative Hypothesis 

In hypothesis testing, the statement of the hypothesis or 
assumed value of the population parameter is always stated before 
we begin taking the sample for analysis. 

A statistical statement about the population parameter assumed 
before taking the sample for possible rejection on the basis of 
outcome of sample data is known as a null hypothesis. 

The null hypothesis asserts that there is no diffeence between 
the sample statistic and population parameter and whatever 
difference is there is attributable to sampling error. 

A hypothesis is said to be alternative hypothesis when it is 
complementary to the null hypothesis. 

The null hypothesis and alternative hypothesis are usually 
denoted by f^ and Hj respectively. 

For example, if we want to test the null hypothesis that the 
average height of soldiers is 173 cms, then 

H :(i= 173=|i (say) 

Hj : n * 173 * IV 

9.4.2 Types of Error 

For testing the hypothesis, we take a sample from the 
population, and on the basis of the sample result obtained, we decide 
whether to accept or reject the hypothesis. 

Here, two types of errors are possible. A null hypothesis 
could be rejected when it is true. This is called Type I error and 
the probability of committing type I error is denoted by a. 

Alternatively, an error could result by accepting a null 
hypothesis when it is false. This is known as Type II error and the 
probability of committing type II error is denoted by (3 . 
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This is illustrated in the following table 



Actual 


Decision based 
on sampling 


Error and its 
Probability 


H is True 


Rejecting l\ 


Type I error ; 
a = P{H,/H } 


H is False 


Accepting Fi, 


Type II error ; 
p = P{H /H 1 } 



9.4.3 Critical region and level of significance 

A region in the sample space which amounts to rejection of 
null hypothesis (F^) is called the critical region. 

After formulating the null and alternative hypotheses about a 
population parameter, we take a sample from the population and 
calculate the value of the relevant statistic, and compare it with the 
hypothesised population parameter. 

After doing this, we have to decide the criteria for accepting 
or rejecting the null hypothesis. These criteria are given as a range 
of values in the form of an interval, say (a, b), so that if the statistic 
value falls outside the range, we reject the null hypothesis. 

If the statistic value falls within the interval (a, b), then we 
accept Fl,. This criterion has to be decided on the basis of the 
level of significance. A 5% level of signififance means that 5% of 
the statistical values arrived at from the samples will fall outside this 
range (a, b) and 95% of the values will be within the range (a, b). 

Thus the level of significance is the probability of Type I 
error a. The levels of significance usually employed in testing of 
hypothesis are 5% and 1%. 

A high significance level chosen for testing a hypothesis would 
imply that higher is the probability of rejecting a null hypothesis if it 
is true. 
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9.4.4 Test of significance 

The tests of significance are (a) Test of significance for large 
samples and (b) Test of significance for small samples. 

For larger sample size (>30), all the distributions like Binomial, 
Poission etc., are approximated by normal distribution. Thus normal 
probability curve can be used for testing of hypothesis. 

For the test statistic Z (standard normal variate), the critical 
region at 5% level is given by I Z l> 1.96 and hence the acceptance 
region is I Z I < 1.96. Where as the critical region for Z at 1% level 
is I Z I > 2.58 and the acceptance region is I Z I < 2.58. 

The testing hypothesis involves five steps : 

(i) The formulation of null hypothesis and an alternative hypothesis 

(ii) Set up suitable significance level. 

(iii) Setting up the statistical test criteria. 

(iv) Setting up rejection region for the null hypothesis. 

(v) Conclusion. 

Example 12 

The mean life time of 50 electric bulbs produced by a 
manufacturing company is estimated to be 825 hours with a 
standard deviation of 110 hours. If \L is the mean life time of 
all the bulbs produced by the company, test the hypothesis 
that \L = 900 hours at 5 % level of significance. 

Solution : 

Null Hypothesis, H : u, = 900 

Alternative Hypothesis, H { : (I ^ 900 

Test statistic , Z is the standard normal variate. 

X — it 

under Ei, , Z = — ^- where X is the sample mean 

J" a = s . d . of the population 
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X —LI 

= — j^- (For large sample, a = s) 

= 825^900 = _ 482 

■JJo 

.-. I Z 1=4.82 

Significant level, a = 0.05 or 5% 

Critical region is I Z I > 1.96 

Acceptance region is I Z I < 1.96 

The calculated Z is much greater than 1.96. 

Decision : Since the calculated value of I Z I = 4.82 falls in 
the critical region, the value of Z is significant at 5% level. 

.-. the null hypothesis is rejected. 

.-. we conclude that the mean life time of the population of 
electric bulbs cannot be taken as 900 hours. 

Example 13 

A company markets car tyres. Their lives are 
normally distributed with a mean of 50000 kilometers and 
standard deviation of 2000 kilometers. A test sample of 
64 tyres has a mean life of 51250 kms. Can you conclude 
that the sample mean differs significantly from the 
population mean? (Test at 5% level) 

Solution : 

Sample size, n = 64 

Sample mean, X= 51250 

H : population mean (i = 50000 

Hj : li * 50000 

X — LL 

Under fi,, the test statistic Z = — ^-~ N(0, 1) 
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51250-50000 

*-• — 2000 — J 

■Jm 
Since the calculated Z is much greater than 1.96, it is highly 
significant. 

H : (i = 50000 is rejected. 

The sample mean differs significantiy from the population mean 

Example 14 

A sample of 400 students is found to have a mean height 
of 171.38 cms. Can it reasonably be regarded as a sample 
from a large population with mean height of 171.17 cms 
and standard deviation of 3.3 cms. (Test at 5 % level) 

Solution : 

Sample size, n = 400 
Sample mean, X = 171.38 
Population mean, (1=171.17 
Sample standard deviation = s. 
Population standard deviation, a = 3.3 
Set H :(i= 171.38 

X — Lt 

The test statistic, Z = — ^-~ N (0, 1) 

X-li . , , . , 

= — — — since the sample is large, s = a 

171.38-171.17 



V400 

= 1.273 

Since I Z I = 1.273 < 1.96, we accept the null hypothesis at 
5% level of signifiance. 

Thus the sample of 400 has come from the population with 
mean height of 171.17 cms. 
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EXERCISE 9.4 

1) The mean I.Q of a sample of 1600 children was 99. Is it likely 
that this was a random sample from a population with mean 
I.Q 100 and standard deviation 15 ? (Test at 5% level of 
significance) 

2) The income distribution of the population of a village has a 
mean of Rs.6000 and a variance of Rs. 32400. Could a sample 
of 64 persons with a mean income of Rs.5950 belong to this 
population? 

(Test at both 5% and 1% levels of significance) 

3) The table below gives the total income in thousand rupees per 
year of 36 persons selected randomly from a particular class 
of people 

Income (thousands Rs.) 



6.5 


10.5 


12.7 


13.8 


13.2 


11.4 


5.5 


8.0 


9.6 


9.1 


9.0 


8.5 


4.8 


7.3 


8.4 


8.7 


7.3 


7.4 


5.6 


6.8 


6.9 


6.8 


6.1 


6.5 


4.0 


6.4 


6.4 


8.0 


6.6 


6.2 


4.7 


7.4 


8.0 


8.3 


7.6 


6.7 



On the basis of the sample data, can it be concluded that the 
mean income of a person in this class of people is Rs. 10,000 
per year? (Test at 5% level of significance) 

4) To test the conjecture of the management that 60 percent 
employees favour a new bonus scheme, a sample of 150 
employees was drawn and their opinion was taken whether 
they favoured it or not. Only 55 employees out of 150 favoured 
the new bonus scheme Test the conjecture at 1% level of 
significance. 

144 



EXERCISE 9.5 
Choose the correct answer 

1) The standard error of the sample mean is 
(a) Type I error (b) Type II error 

(c) Standard deviation of the sampling distribution of the mean 

(d) Variance of the sampling distribution of the mean 

2) If a random sample of size 64 is taken from a population 
whose standard deviation is equal to 32, then the standard 
error of the mean is 

(a) 0.5 (b) 2 (c) 4 (d) 32 

3) The central limit theorem states that the sampling distribution 
of the mean will approach normal distribution 

(a) As the size of the population increases 

(b) As the sample size increases and becomes larger 

(c) As the number of samples gets larger 

(d) As the sample size decreases 

4) The Z-value that is used to establish a 95% confidence interval 
for the estimation of a population parameter is 

(a) 1.28 (b) 1.65 (c) 1.96 (d) 2.58 

5) Probability of rejecting the null hypothesis when it is true is 
(a) Type I error (b) Type II error 

(c) Sampling error (d) Standard error 

6) Which of the following statements is true? 

(a) Point estimate gives a range of values 

(b) Sampling is done only to estimate a statistic 

(c) Sampling is done to estimate the population parameter 

(d) Sampling is not possible for an infinite population 

7) The number of ways in which one can select 2 customers out 
of 10 customers is 

(a) 90 (b) 60 (c) 45 (d) 50 
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10.1 LINEAR PROGRAMMING 

Linear programming is the general technique of optimum 
allocation of limited resources such as labour, material, machine, 
capital etc., to several competing activities such as products, services, 
jobs, projects, etc., on the basis of given criterion of optimality. 
The term limited here is used to describe the availability of scarce 
resources during planning period. The criterion of optimality 
generally means either performance, return on investment, utility, 
time, distance etc., The word linear stands for the proportional 
relationship of two or more variables in a model. Programming 
means 'planning' and refers to the process of determining a particular 
plan of action from amongst several alternatives. It is an extremely 
useful technique in the decision making process of the management. 

10.1.1 Structure of Linear Programming Problem (LPP) 

The LP model includes the following three basic elements. 

(i) Decision variables that we seek to determine. 

(ii) Objective (goal) that we aim to optimize (maximize or 
minimize) 

(iii) Constraints that we need to satisfy. 

10.1.2 Formulation of the Linear Programming Problem 

The procedure for mathematical formulation of a linear 
programming consists of the following major steps. 

Step 1 : Study the given situation to find the key decision to be 

made 
Step 2 : Identify the variables involved and designate them by 

symbols x(j = 1, 2 ...) 
Step 3 : Express the feasible alternatives mathematically in terms 

of variables, which generally are : x. > for all j 
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Step 4 : Identify the constraints in the problem and express them 
as linear inequalities or equations involving the decision 
variables. 

Step 5 : Identify the objective function and express it as a linear 
function of the decision variables. 

10.1.3 Applications of Linear programming 

Linear programming is used in many areas. Some of them are 

(i) Transport : It is used to prepare the distribution plan between 
source production and destination. 

(ii) Assignment : Allocation of the tasks to the persons available 
so as to get the maximum efficiency. 

(iii) Marketing : To find the shortest route for a salesman who 
has to visit different locations so as to minimize the total cost. 

(iv) Investment : Allocation of capital to differerent activities so 
as to maximize the return and minimize the risk. 

(v) Agriculture : The allotment of land to different groups so as 
to maximize the output. 

10.1.4 Some useful Definitions 

A feasible solution is a solution which satisfies all the 
constraints (including non-negativity) of the problem. 

A region which contains all feasible solutions is known as 
feasible region 

A feasible solution which optimizes (maximizes or minimizes) 
the objective function,is called optimal solution to the problem. 

Note 

Optimal solution need not be unique. 

Example 1 

A furniture manufacturing company plans to make 
two products, chairs and tables from its available 
resources, which consists of 400 board feet of mahogany 
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timber and 450 man-hours of labour. It knows that to make 
a chair requires 5 board feet and 10 man-hours and yields 
a profit of Rs.45, while each table uses 20 board feet and 
15 man - hours and has a profit of Rs.80. How many chairs 
and tables should the company make to get the maximum 
profit under the above resource constraints? Formulate the 
above as an LPP. 
Solution : 

Mathematical Formulation : 

The data of the problem is summarised below: 



Products 


Raw material 
(per unit) 


Labour 
(per unit) 


Profit 
(per unit) 


Chair 
Table 


5 
20 


10 

15 


Rs.45 
Rs. 80 


Total availability 


400 


450 





Step 1 : The key decision to be made is to determine the number 
of units of chairs and tables to be produced by the 
company. 

Step 2 : Let x l designate the number of chairs and x 2 designate 
the number of tables, which the company decides to 
produce. 

Step 3 : Since it is not possible to produce negative quantities, 
feasible alternatives are set of values of x x and x v such 
that Xj > and x 2 > 

Step 4 : The constraints are the limited availability of raw material 
and labour. One unit of chair requires 5 board feet of 
timber and one unit of table requires 20 board feet of 
timber. Since x 1 and x 2 are the quantities of chairs and 
tables, the total requirement of raw material will be 
5xj + 20x 2 , which should not exceed the available raw 
material of 400 board feet timber. So, the raw material 
constraint becomes, 

5^j + 20;t 2 < 400 
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Similarly, the labour constraint becomes, 

10*!+ 1 5* 2 < 450 

Step 5 : The objective is to maximize the total profit that the 
company gets out of selling their products, namely chairs, 
tables. This is given by the linear function. 

z= 45* t + 80* 2 . 

The linear programming problem can thus be put in the 
following mathematical form. 

maximize z = 45* { + 80* 2 



subject to 5*! + 20* 2 < 400 
10*!+ 1 5* 2 < 450 

*! > 0, * 2 > 

Example 2 

A firm manufactures headache pills in two sizes A and 
B. Size A contains 2 mgs. of aspirin. 5 mgs. of bicarbonate 
and 1 mg. of codeine. Size B contains 1 mg. of aspirin, 8 mgs. 
of bicarbonate and 6 mgs. of codeine. It is found by users 
that it requires atleast 12 mgs. of aspirin, 74 mgs. of 
bicarbonate and 24 mgs. of codeine for providing immediate 
relief. It is required to determine the least number of pills a 
patient should take to get immediate relief. Formulate the 
problem as a standard LPP. 

Solution : 



The data can 


be summarised 


as follows : 




Head ache 
pills 




Aspirin 


per pill 
Bicarbonate Codeine 


Size A 
SizeB 




2 
1 


5 
8 


1 
6 


Minimum requirement 


12 


74 


24 
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Decision variables : 

x 1 = number of pills in size A 
x 2 = number of pills in size B 

Following the steps as given in (10.1.2) the linear programming 
problem can be put in the following mathematical format : 

Maximize z = x { +x 2 
subject to 2jCj + x 2 > 12 

5xj + &x; 2 > 74 
x l + 6x 2 > 24 
x l >0, x 2 >0 

10.1.5 Graphical method 

Linear programming problem involving two decision variables 
can be solved by graphical method. The major steps involved in 
this method are as follows. 

Step 1 : State the problem mathematically. 

Step 2 : Plot a graph representing all the constraints of the problem 
and identify the feasible region (solution space). The 
feasible region is the intersection of all the regions 
represented by the constraints of the problem and is 
restricted to the first quadrant only. 

Step 3 : Compute the co-ordinates of all the corner points of the 
feasible region. 

Step 4 : Find out the value of the objective function at each comer 
point determined in step3. 

Step 5 : Select the corner point that optimizes (maximzes or 
minimizes) the value of the objective function. It gives the 
optimum feasible solution. 

Example 3 

A company manufactures two products P, and P 2 . The 
company has two types of machines A and B for processing 
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the above products. Product P 1 takes 2 hours on machine A 
and 4 hours on machine B, whereas product P 2 takes 5 hours 
on machine A and 2 hours on machine B. The profit realized 
on sale of one unit of product P x is Rs.3 and that of product P 2 
is Rs. 4. If machine A and B can operate 24 andl6 hours per 
day respectively, determine the weekly output for each 
product in order to maximize the profit, through graphical 
method. 

Solution : 

The data of the problem is summarized below. 



Product 


Hours 
Machine A 


; on 
Machine B 


profit 
(per unit) 


Pi 
P 2 


2 
5 




4 

2 


3 
4 


Max. hours / week 


120 




80 





Let Xj be the number of units of Pj and x 2 be the number of 
units of P 2 produced. Then the mathematical formulation of the 
problem is 

Maximize z = 3x l + 4x 2 

subject to 2x { + 5x 2 < 120 

4*j + 2x 2 <80 

Solution by graphical method 

Consider the equation 2xj + 5x 2 = 120, and 4x x + 2x 2 = 80. 
Clearly (0, 24) and (60, 0) are two points on the line 2x 1 + 5x 2 = 
120. By joining these two points we get the straight line 2x 1 + 5x 2 
= 120. Similarly, by joining the points (20, 0) and (0, 40) we get 
the sraight line 4x { + 2x 2 = 80. (Fig. 10.1) 
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Now all the constraints have been represented graphically. 



z /v 




80" 






70" 






60" 






50" 
40 J 


\ x 




30- 
B 1 

20" 

io- 


.V s 

1 — ^^ — 1 l> A 1 


H > 



o 



10 20 30 40 50 60 70 80 ' 
(Fig. 10.1) 

The area bounded by all the constrints called feasible region 
or solution space is as shown in the Fig. 10.1, by the shaded area 
OCMj B 

The optimum value of objective function occurs at one of the 
extreme (corner) points of the feasible region. The coordinates of 
the extreme points are 

= (0, 0), C = (20, 0), M 1 = (10, 20), B = (0, 24) 
We now compute the z- values correspoinding to extreme points. 



Extreme 
point 


coordinates 


z = 3x { + 4x 2 


O 
C 

B 


(0,0) 
(20, 0) 
(10, 20) 

(0, 24) 



60 
110 
96 



The optimum solution is that extreme point for which the 
objective function has the largest (maximum) value. Thus the 
optimum solution occurs at the point M { i.e. x 1 = 10 and x 2 = 20. 

Hence to maximize profit of Rs.110, the company should 
produce 10 units of P t and 20 units of P 2 per week. 
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Note 



In case of maximization problem, the comer point at which the 
objective function has a maximum value represent the optimal solution, 
mease of minimization problem, the comer point at which the objective 
function has a minimum value represents the optimnal solution. 

Example 4 

Solve graphically : 

Minimize z = 20jc x + 40jc 2 



Subject to 



36x l + 6x 2 > 108 



3x l 


+ 12x 2 _ 


>36 


20* 


l + 10jc 2 


>100 


•**!? 


x 2 >0 





Solution : 

A(0, 18) and B(3, 0) ; C(0, 3) and D(12, 0) ; E(0, 10) and 
F(5, 0) are the points on the lines 36*^ + 6x 2 = 108, 3x { + \2x 2 = 
36 and 20*; t + \0x 2 = 100 respectively. Draw thte above lines as 
Fig. 10.2. 



feasible region 




2 B 4 F 6 8 10 12° 14 16 "' 
(Fig. 10.2) 

Now all the constraints of the given problem have been 
graphed. The area beyond three lines represents the feasible region 
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or solution space, as shown in the above figure. Any point from 

this region would satisfy the constraints. 

The coordinates of the extreme points of the feasible region are: 
A = (0,18), M 2 = (2,6), M 1 = (4,2), D = (12, 0) 
Now we compute the z-values corresponding to extreme 

points. 



Extreme 


coordinates 


z= 20^ + 40*2 


point 


V^p X 2 ) 




A 


(0, 18) 


720 


M, 


(4,2) 


160 


M 2 


(2,6) 


280 


D 


(12,0) 


240 



The optimum solution is that extreme point for which the 
objective function has minimum value. Thus optimum solution 
occurs at the point Mj i.e. x 1 = 4 and x 2 = 2 with the objective 
function value of z = 160 .•. Minimum z = 160 at x l = 4, x 2 = 2 

Example 5 

Maximize z = x l + x 2 subject to x l + x 2 < 1 

4x 1 + 3x 2 > 12 



Solution : 




x 2 >0 



— h 

I" 2 3" 4 
(Fig. 10.3) 
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From the graph, we see that the given problem has no solutiion 
as the feasible region does not exist. 

EXERCISE 10.1 

1) A company produces two types of products say type A and B. 
Profits on the two types of product are Rs.30/- and Rs.40/- 
per kg. respectively. The data on resources required and 
availability of resources are given below. 





Requirements 


Capacity 

available 

per month 


Product A 


Product B 


Raw materials (kgs) 
Machining hours / piece 
Assembling (man hours) 


60 

8 

3 


120 

5 

4 


12000 
600 
500 



Formulate this problem as a linear programming problem to 
maximize the profit. 

2) A firm manufactures two products A & B on which the profits 
earned per unit are Rs.3 and Rs.4 respectively. Each product 
is processed on two machines M 1 and M,. Product A requires 
one minute of processing time on M l and two minutes on Mj, 
while B requires one minute on \^ and one minute on M,. 
Machine Mj is available for not more than 7 hrs 30 minutes 
while M, is available for 10 hrs during any working day. 
Formulate this problem as a linear programming problem to 
maximize the profit. 

3) Solve the following, using graphical method 
Maximize z - 45xj + 80x 2 

subject to the constraints 
5xj + 20x 2 < 400 
10xj+ 15x 2 <450 
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4) Solve the following, using graphical method 
Maximize z - 3xj + 4x 2 

subject to the constraints 

2xj + x 2 < 40 
2xj + 5x 2 < 180 
x p x 2 > 

5) Solve the following, using graphical method 
Minimize z - 3xj + 2x 2 

subject to the constraints 
5xj + x 2 > 10 
2xj + 2c 2 >12 
Xj + 4x 2 > 12 

10.2 CORRELATION AND REGRESSION 

10.2.1 Meaning of Correlation 

The term correlation refers to the degree of relationship 
between two or more variables. If a change in one variable effects 
a change in the other variable, the variables are said to be correlated. 
There are basically three types of correlation, namely positive 
correlation, negative correlation and zero correlatiion. 

Positive correlation 

If the values of two variables deviate (change) in the same 
direction i.e. if the increase (or decrease) in one variable results in 
a corresponding increase (or decrease) in the other, the correlation 
between them is said to be positive. 

Example 

(i) the heights and weights of individuals 
(ii) the income and expenditure 
(iii) experience and salary. 
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Negative Correlation 

If the values of the two variables constantly deviate (change) 
in the opposite directions i.e. if the increase (or decrease) in one 
results in corresponding decrease (or increase) in the other, the 
correlation between them is said to be negative. 

Example 

(i) price and demand , 

(ii) repayment period and EMI 

10.2.2 Scatter Diagram 

Let (x v Vj), (x v y 2 ) ... (x , y n ) be the n pairs of observation 
of the variables x and y. If we plot the values of x along x-axis 
and the corresponding values of y along y-axis, the diagram so 
obtained is called a scatter diagram. It gives us an idea of 
relationship between x and y. The types of scatter diagram under 
simple linear correlation are given below. 

Yf YA 



o 



Positive Correlation 
(Fig. 10.4) 

Yf 



'X 



O 



Negative Correlation 
(Fig. 10.5) 



•x 



O 



No Correlation 
(Fig. 10.6) 
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X 



(i) If the plotted points show an upward trend, the correlation 
will be positive (Fig. 10.4). 

(ii) If the plotted points show a downward trend, the correlation 
will be negative (Fig. 10.5). 

(iii) If the plotted point show no trend the variables are said to be 
uncorrelated (Fig. 10.6). 

10.2.3 Co-efficient of Correlation 

Karl pearson (1867-1936) a British Biometrician, developed 
the coefficient of correlation to express the degree of linear 
relationship between two variables. Correlation co-efficient between 
two random variables X and Y denoted by r(X, Y), is given by 

Cov(X, Y) 
r(X, Y) = SD(X) SD(Y) 

where 

Cov (X, Y) = — L(X ; -X)(Y. - Y) (co variance between X and Y) 

tl i 

SD (X) = a x = J-0(X/ -X) 2 (standard deviation of X) 
SD(Y) =G y = J-OY.-Y) 2 (standard deviationof Y) 

Hence the formula to compute Karl Pearson correlation 
co-efficient is 

ic^.-XXY-Y) 
r(X, Y) = 



,/!d(X,-X) 2 ,p-6(Y,.-Y) 2 

0(X,-X)(Y,-Y) ^ y 



^6(x,-x) 2 ^6(Y-Y) 2 ^/z7V^ 
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Note 

The following formula may also be used to compute correlation 
co-efficient between the two variables X and Y. 

nExy-ExEy 
(1) r(X ' Y) = a /nEx 2 -(Zx) 2 Va^y 2 -(Ey) 2 

NLdxdy - YAx YAy 
(ii) r(X,Y) = 



■sJNUdx 2 -(Zdxf^NZdy 2 -(Tdyf 



where dx = x - A ; dy = y - B are the deviations from 
arbitrary values A and B. 

10.2.4 Limits for Correlation co-efficient 

Correlation co-efficient lies between -1 and +1. 
i.e. -1 < r(x, y) < 1. 

(i) If r(X , Y) = +1 the variables X and Y are said to be perfectly 
possitively correlated. 

(ii) If r(X , Y) =-1 the variables X and Y are said to be perfectly 
negatively correlated. 

(in) If r(X , Y) = the variables X and Y are said to be 
uncorrelated. 

Example 6 

Calculate the correlation co-efficient for the following 
heights (in inches) of fathers(X) and their sons(Y). 

X : 65 66 67 67 68 69 70 72 

Y : 67 68 65 68 72 72 69 71 



Solution 



X = ^ = 544 = 68 
n 8 

Y = ZY = 5|2 = 69 
n 8 
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X Y *=X-X y=Y-Y x 2 y 2 xy 



65 


67 


-3 


-2 


9 


4 


6 


66 


68 


-2 


-1 


4 


1 


2 


67 


65 


-1 


-4 


1 


16 


4 


67 


68 


-1 


-1 


1 


1 


1 


68 


72 





3 





9 





69 


72 


1 


3 


1 


9 


3 


70 


69 


2 





4 








72 


71 


4 


2 


16 


4 


8 



544 552 36 44 24 

Karl Pearson Correlation Co-efficient, 

£jcy 24 . m 

fix, y) = i r-^ = = , — / — = .603 

a/z7VV a/36 V44 

Since fix, y) = .603, the variables X and Y are positively 
correlated, i.e. heights of fathers and their respective sons are said 
to be positively correlated. 

Example 7 

Calculate the correlation co-efficient from the data below: 

X:123456789 
Y : 9 8 10 12 11 13 14 16 15 

Solution : 



X 


Y 


X 2 


Y 2 


XY 


1 


9 


1 


81 


9 


2 


8 


4 


64 


16 


3 


10 


9 


100 


30 


4 


12 


16 


144 


48 


5 


11 


25 


121 


55 


6 


13 


36 


169 


78 


7 


14 


49 


196 


98 


8 


16 


64 


256 


128 


9 


15 


81 


225 


135 


45 


108 


285 


1356 


597 
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, vv , nZxy-ExZy 

r (X,Y) 



•JnEx 2 -(Ex) 2 Va^y 2 -(ZY) 2 
9(597) -(45) (108) 



.95 



•^9(285)- (45) 2 V9(1356)-(108) 2 
.-. X and Y are highly positively correlated. 

Example 8 

Calculate the correlation co-efficient for the ages of 
husbands (X) and their wives (Y) 

X : 23 27 28 29 30 31 33 35 36 39 

Y: 18 22 23 24 25 26 28 29 30 32 

Solution : 

Let A = 30 and B = 26 then dx = X-A dy = Y-B 



X 


Y 


d x 


d y 


d 2 

u X 


d 2 


d A 


23 


18 


-7 


-8 


49 


64 


56 


27 


22 


-3 


-4 


9 


16 


12 


28 


23 


-2 


-3 


4 


9 


6 


29 


24 


-1 


-2 


1 


4 


2 


30 


25 





-1 





1 





31 


26 


1 





1 








33 


28 


3 


2 


9 


4 


6 


35 


29 


5 


3 


25 


9 


15 


36 


30 


6 


4 


36 


16 


24 


39 


32 


9 


6 


81 


36 


54 






11 


-3 


215 


159 


175 








NZdxdy - 


-Ydx YAy 







r (*' y) - J/YZJ 2 -{Idxf^NIAl -(Ldy) 2 
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10(175) -(ll)(-3) 



1783 



1790.8 



0.99 



.-. X and Y are highly positively correlated, i.e. the ages of 
husbands and their wives have a high degree of correlation. 

Example 9 

Calculate the correlation co-efficient from the following 
data 



N = 25, 




EX = 125, EY = 100 


EX 2 = 650 




EY 2 = 436, EXY = 520 


Solution : 






We know, 










NEXY-OXOY 


v — 


VNEX 2 -(EX) 2 ^AEY 2 -(EY) 2 






25(520) -(125) (100) 



-^25(650) - (125) 2 ^25(436) - (100) 2 
r= -0.667 

10.2.5 Regression 

Sir Francis Galton (1822 - 1911), a British biometrician, 
defined regression in the context of heriditary characteristics. The 
literal meaning of the word "regression" is "Stepping back towards 
the average". 

Regression is a mathematical measure of the average 
relationship between two or more variables in terms of the original 
units of the data. 

There are two types of variables considered in regression 
analysis, namely dependent variable and independent variable(s). 
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10.2.6 Dependent Variable 

The variable whose value is to be predicted for a given 
independent variable(s) is called dependent variable, denoted by 
Y. For example, if advertising (X) and sales (Y) are correlated, we 
could estimate the expected sales (Y) for given advertising 
expenditure (X). So in this case Y is a dependent variable. 

10.2.7 Independent Variable 

The variable which is used for prediction is called an 
independent variable. For example, it is possible to estimate the 
required amount of expenditure (X) for attaining a given amount of 
sales (Y), when X and Y are correlated. So in this case Y is 
independent variable. There can be more than one independent 
variable in regression. 

The line of regression is the line which gives the best estimate 
to the value of one variable for any specific value of the other 
variable. 

Thus the line of Regression is the line of best fit and is 
obtained by the principle of least squares. (Refer pages 61 & 62 
of Chapter 7). The equation corresponds to the line of regression is 
also referred to as regression equation. 

10.2.8 Two Regression Lines 

For the pair of values of (X, Y), where X is an independent 
variable and Y is the dependent variable the line of regression of Y 
on X is given by 

Y- y=^(X-X) 
where b is the regression co-efficient of Y on X andgi yen 

by b = r . __L , where r is the correlation co-efficient between X 

X 

and Y and o and a are the standard deviations of X and Y 

x y 

respectively. 

.-. b = — — where x = X - X and y = Y - Y 

yx Ex 2 y 
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Similarly when Y is treated as an independent variable and 
X as dependent variable, the line of regression of X on Y is given by 

(X-X) = b xy (Y-Y) 

where b == r ^ =^EL herex = X-X ;v = Y-Y 
°> Zy 
Note 

The two regression equations are not reversible or 
interchangeable because of the simple reason that the basis and 
assumption for deriving these equations are quite different. 

Example 10 

Calculate the regression equation of X on Y from the 
following data. 

X : 10 12 13 12 16 15 

Y : 40 38 43 45 37 43 

Solution : 



X Y 


x=X-X 


v=Y- 


Y 


X 2 


f 


xy 


10 40 


-3 


-1 




9 


1 


3 


12 38 


-1 


-3 




1 


9 


3 


13 43 





2 







4 





12 45 


-1 


4 




1 


16 


-4 


16 37 


3 


-4 




9 


16 


-12 


15 43 


2 


2 




4 


4 


4 


78 246 










24 


50 


6 


_ EX 

x =ir = 


-^-13 
- 6 - 13 




Y = 


ZY 

n 


246 
- 6 


= 41 


Zxy 

b = „ 2 

*y Zy 


_-6 

50 ~" 


-0.12 










Regression 


equation of X on Y 


is (X- 


-x) = 


b (Y- 

xy v 


X-13 = 


-0.12 (Y 


-41): 


=> 


X= 17.92- 


0.1 2Y 
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Example 11 

Marks obtained by 10 students in Economics and 
Statistics are given below. 

Marks in Economics : 25 28 35 32 31 36 29 38 34 32 

Marks in Statistics : 43 46 49 41 36 32 31 30 33 39 

Find (i) the regression equation of Y on X 

(ii) estimate the marks in statistics when the marks 
in Economics is 30. 

Solution : 

Let the marks in Economics be denoted by X and statistics 
by Y. 



(i) 



X 


Y 


*=X-X y- 


=Y-y 


X 2 


f 


xy 


25 


43 


-7 


5 


49 


25 


-35 


28 


46 


-4 


8 


16 


64 


32 


35 


49 


3 


11 


9 


121 


33 


32 


41 





3 





9 





31 


36 


-1 


-2 


1 


4 


2 


36 


32 


4 


-6 


16 


36 


-24 


29 


31 


-3 


-7 


9 


49 


21 


38 


30 


6 


-8 


36 


64 


-48 


34 


33 


2 


-5 


4 


25 


-10 


32 


39 





1 





1 





320 


380 








140 


398 


-93 


X 


_ ex 

n 


320 ,« 
"10 




Y = 


ZY 380 
n 10 


= 38 


b 

yx 


Zxy 
~ Zx 2 


= 14 9 3 =-0.664 








Regression 


equation of Y on X 


is 








Y- 


Y =b (X- 

yx v 


-x) 










Y- 


38 = -0.664 (X - 32) 






=> 


Y = 59.25- 


- 0.664X 
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(ii) To estimate the marks in statistics (Y) for a given marks in the 
Economics (X), put X = 30, in the above equation we get, 

Y =59.25-0.664(30) 

= 59.25- 19.92 = 39.33 or 39 

Example 12 

Obtain the two regression equations for the following 
data. 

X : 4 5 6 8 11 

Y : 12 10 8 7 5 

Solution : 

The above values are small in magnitude. So the following 
formula may be used to compute the regression co-efficient. 





xy_ m:y 2 -(ey) 2 








b NZXY-OXOY 

yx m:x 2 -(ex) 2 






X 


Y x 2 


Y 2 


XY 


4 


12 16 


144 


48 


5 


10 25 


100 


50 


6 


8 36 


64 


48 


8 


7 64 


49 


56 


11 


5 121 


25 


55 


34 


42 262 


382 


-257 


X = 


^=^ = 6.8 Y 
n 5 


_£Y, 

n 


= f = 8, 


b XY 


5(257) - (34)(42) 
5(382) -(42) 2 


-0.98 




b YX 


5(257) - (34)(42) 
5(262) -(34) 2 


■0.93 





Regression Equation of X on Y is 
(X-X) = b XY (Y-Y) 
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=> X-6.8 = -0.98(Y-8.4) 

X= 15.03 -0.98Y 

Regression equation of Y on X is 

Y- Y =b YX (X-X) 

Y - 8.4 = -0.93 (X- 6.8) 
=> Y= 14.72 -0.93X 

EXERCISE 10.2 

1) Calculate the correlation co-efficient from the following data. 
X : 12 9 8 10 11 13 7 

Y : 14 8 6 9 11 12 3 

2) Find the co-efficient of correlation for the data given below. 
X : 10 12 18 24 23 27 

Y : 13 18 12 25 30 10 

3) From the data given below, find the correlation co-efficient. 
X : 46 54 56 56 58 60 62 

Y : 36 40 44 54 42 58 54 

4) For the data on price (in rupees) and demand (in tonnes) for a 
commodity, calculate the co-efficient of correlation. 

Price (X) : 22 24 26 28 30 32 34 36 38 40 
Demand(Y) : 60 58 58 50 48 48 48 42 36 32 

5) From the following data, compute the correlation co-efficient. 
N = ll, IX = 117, IY = 260, IX 2 = 1313 

IY 2 = 6580, IXY = 2827 

6) Obtain the two regression lines from the following 
X : 6 2 10 4 8 

Y : 9 11 5 8 7 

7) With the help of the regression equation for the data given 
below calculate the value of X when Y = 20. 

X : 10 12 13 17 18 

Y : 5 6 7 9 13 
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8) Price indices of cotton (X) and wool (Y) are given below for 
the 12 months of a year. Obtain the equations of lines of 
regression between the indices. 

X : 78 77 85 88 87 82 81 77 76 83 97 93 

Y : 84 82 82 85 89 90 88 92 83 89 98 99 

9) Find the two regression equations for the data given below. 
X : 40 38 35 42 30 

Y : 30 35 40 36 29 

10.3 TIME SERIES ANALYSIS 

Statistical data which relate to successive intervals or points 
of time, are referred to as "time series". 

The following are few examples of time series. 

(i) Quarterly production, Half-yearly production, and 
yearly production for particular commodity. 

(if) Amount of rainfall over 10 years period. 

(iii) Price of a commodity at different points of time. 

There is a strong notion that the term " time series" usually 
refer only to Economical data. But it equally applies to data arising 
in other natural and social sciences. Here the time sequence plays 
a vital role and it requires special techniques for its analysis. In 
analysis of time series, we analyse the past in order to understand 
the future better. 

10.3.1 Uses of analysis of Time Series 

(i) It helps to study the past conditions, assess the present 

achievements and to plan for the future, 
(ii) It gives reliable forcasts. 
(iii) It provides the facility for comparison. 

Thus wherever time related data is given in Economics, 
Business, Research and Planning, the analysis of time series provides 
the opportunity to study them in proper perspective. 
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10.3.2 Components of Time Series 

A graphical representation of a Time Series data, generally 
shows the changes (variations) over time. These changes are known 
as principal components of Time Series . They are 
(i) Secular trend (ii) Seasonal variation 

(iii) Cyclical variation (iv) Irregular variation. 

Secular trend (or Trend) 

It means the smooth, regular, long-term movement of a series 
if observed long enough. It is an upward or downward trend. It 
may increase or decrease over period of time. For example, time 
series relating to population, price, production, literacy ,etc. may 
show increasing trend and time series relating to birth rate, death 
rate, poverty may show decreasing trend. 

Seasonal Variation 

It is a short-term variation. It means a periodic movement in 
a time series where the period is not longer than one year. A periodic 
movement in a time series is one which recurs or repeats at regular 
intervals of time or periods. Following are the examples of seasonal 
variation. 

(i) passenger traffic during 24 hours of a day 

(ii) sales in a departmental stores during the seven days of a week. 

The factors which cause this type of variation are due to 
climatic changes of the different season, customs and habits of the 
people. For example more amount of ice creams will be sold in 
summer and more number of umbrellas will be sold during rainy 
seasons. 

Cyclical Variation 

It is also a short-term variation. It means the oscillatory 
movement in a time series, the period of oscillation being more than 
a year. One complete period is called a cycle. Business cycle is the 
suitable example for cyclical variation. There are many time series 
relating to Economics and Business, which have certain wave-like 
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movements called business cycle. The four phases in business cycle, 
(i) prosperity (ii) recession (iii) depression (iv) recovery, recur 
one after another regularly. 



(i) prosperity 



(i) prosperity 




(iii) depression 
(Fig. 10.7) 

Irregular Variation 

This type of variation does not follow any regularity. These 
variations are either totally unaccountable or caused by unforeseen 
events such as wars, floods, fire, strikes etc. Irregular variation is 
also called as Erratic Variation. 

10.3.3 Models 

In a given time series, some or all the four components, namely 
secular trend, seasonal variation, cyclical variation and irregular 
variation may be present. It is important to separate the different 
components of times series because either our interest may be on a 
particular component or we may want to study the series after 
eliminating the effect of a particular component. Though there exist 
many models, here we consider only two models. 

Multiplicative Model 

According to this model, it is assumed that there is a 
multiplicative relationship among the four components, i.e., 
y t = T t xS t xC t xI t , 

Where y is the value of the variable at time t, or observed 
data at time t, T is the Secular trend or trend, S is the Seasonal 
variation, C t is the Cyclical variation and I is the Irregular variation 
or Erratic variation. 

170 



Additive Model 

According to this model, it is assumed that y be the sum of 
the four components. 

y t = T t + S t + C t + I t , 

10.3.4 Measurement of secular trend 

The following are the four methods to estimate the secular 
trend 

(i) Graphic method or free - hand method 
(ii) Method of Semi - Averages 
(iii) Method of Moving Averages 
(iv) Method of least squares. 

(i) Graphic Method / Free - hand Method 

This is the simplest method of studying the trend procedure. 
Let us take time on the x - axis, and observed data on the y-axis. 
Mark a point on a graph sheet, corresponding to each pair of time 
and observed value. After marking all such possible points, draw a 
straight line which will best fit to the data according to personal 
judgement. 

It is to be noted that the line should be so drawn that it passes 
between the plotted points in such a manner that the fluctuations in 
one direction are approximately equal to those in other directions. 

When a trend line is fitted by the free hand method an attention 
should be paid to conform the following conditions. 

(i) The number of points above the line is equal to the 
number of points below the line, as far as possible. 

(ii) The sum of the vertical deviations from the trend of the 
annual observation above the trend should equal the 
sum of the vertical deviations from the trend of the 
observations below the trend. 

(iii) The sum of the squares of the vertical deviations of the 
observations from the trend should be as small as 
possible. 
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Example 13 

Fit a trend line to the following data by the free hand 



mehtod. 












year 

production of steel 


1978 
20 


1979 

22 


1980 
24 


1981 
21 


1982 
23 


year 

production of steel 


1983 

25 


1984 

23 


1985 
26 


1986 

25 





Solution 



27 

c 


Trend by the freehand method 


















<*^^ 


.2 25 

s 3 










^f«* 


2^*' 






c § 23 
o c 
•n o 
o H 




£~~ 
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o 
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1978 1979 1980 1981 1982 1983 1984 1985 1986 
Year 

(Fig. 10.8) 



Note 

(i) 



The trend line drawn by the free hand method can be extended 
to predict future values. However, since the free hand curve 
fitting is too subjective, this method should not be generally 
used for predictions 

(ii) In the above diagram false base line (zig-zag) has been used. 
Generally we use false base line following objectives. 

(a) Variations in the data are clearly shown 

(b) A large part of the graph is not wasted or space is saved. 

(c) The graph provides a better visual communications. 
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(ii) Method of Semi Averages 

This method involves very simple calculations and it is easy 
to adopt. When this method is used the given data is divided into 
two equal parts. For example, if we are given data from 1980 to 
1999, i.e., over a period of 20 years, the two equal points will be 
first 10 years, i.e. from 1980 to 1989, and from 1990 to 1999. In 
case of odd number of years like 7, 11, 13 etc., two equal parts can 
be made by omitting the middle year. For example, if the data are 
given for 7 years from 1980 to 1986, the two equal parts would be 
from 1980 to 1982 and from 1984 to 1986. The middle year 1983 
will be omitted. 

After dividing the data in two parts, find the arithmetic mean 
of each part. Thus we get semi averages from which we calculate 
the annual increase or decrease in the trend. 

Example 14 

Find trend values to the following data by the method 
of semi-averages. 



\ear 


1980 


1981 


1982 


1983 


1984 


1985 


1986 


Sales 


102 


105 


114 


110 


108 


116 


112 



Solution : 

No. of years = 7 (odd no.) By omitting the middle year 
(1983) we have 



Year 


Sales 


Semi total 


Semi-average 


1980 


102 






1981 


105 


321 


107 


1982 


114 






1983 


110 






1984 


108 






1985 


116 


336 


112 


1986 


112 







Difference between middle periods 
Difference between semi averages 
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1985-1981=4 
112- 107 = 5 



Annual increase in trend 



= 4=1.25 



Year 1980 1981 1982 1983 1984 1985 1986 
Trend 105.75 107 108.25 109.50 110.75 112 113.25 

Example 15 

The sales in tonnes of a commodity varied from 1994 
to 2001 as given below: 



\ear 
Sales 



\ear 
Sales 



1994 

270 



1995 
240 



1996 
230 



1997 
230 



1998 

220 



1999 
200 



2000 
210 



2001 
200 



Find the trend values by the method of semi-average. 
Estimate the sales in 2005. 

Solution : 



Year 


Sales 


Semi total Semi average 


1994 


270 




1995 
1996 


240 
230 > 


970 > 242.5 


1997 


230 




1998 


220 




1999 
2000 


200 
210 


830 > 207.5 


2001 


200 





Difference between middle periods 
Decrease in semi averages 

Annual decrease in trend 
Half yearly decrease in trend 
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= 1999.5- 1995.5 = 4 

= 242.5 - 207.5 = 35 

= f- = 8.75 
= 4.375 



Year 
Sales trend 


1994 1995 1996 1997 

255.625 246.875 238.125 229.375 


Year 
Sales trend 


1998 1999 2000 2001 
220.625 211.875 203.125 194.375 



Trend value for the year 2005 = 194.375 - (8.75 x 4) = 159.375 

(iii) Method of Moving Averages 

This method is simple and flexible algebraic method of 
measuring trend. The method of Moving Average is a simple device 
for eliminating fluctuations and obtaining trend values with a fair 
degree of accuracy. The technique of Moving Average is based on 
the arithmetic mean but with a distinction. In arithmetic mean we 
sum all the items and divide the sum by number of items, whereas in 
Moving Average method there are various averages in one series 
depending upon the number of years taken in a Moving Average. 
While applying this method, it is necessary to define a period for 
Moving Average such as 3 yearly moving average (odd number of 
years), 4-yearly moving average (even number of years) etc. 

Moving Average - Odd number of years (say 3 years) 

To find out the trend values by the method of 3-yearly moving 
averages, the following steps are taken into consideration. 

1) Add up the values of the first three years and place the yearly 
sum against the median year i.e. the 2nd year. 

2) Leave the first year item and add up the values of the next 
three years, i.e. from the 2nd year to 4 th year and place the 
sum (known as moving total) against the 3rd year. 

3) Leave the first two items and add the values of the next three 
years, i.e. from 3rd year to the 5th year and place the sum 
(moving total) against the 4th year. 

4) This process must be continued till the value of the last item is 
taken for calculating the moving average. 

5) Each 3-yearly moving total must be divided by 3 to get 
themoving averages. This is our required trend values. 
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Note 

The above 5 steps can be applied to get 5-years, 7-years, 9- 
years etc., Moving Averages. 

Moving Average - Even number of years (say 4 years) 

1) Add up the values of the first four years and place the sum 
against the middle of 2nd and 3rd years. 

2) Leave the first year value and add from the 2nd year onwards 
to the 5th year and write the sum (moving total) against the 
middle of the 3rd and the 4th items. 

3) Leave the first two year values and add the values of the next 
four years i.e. from the 3rd year to the 6th year. Place the 
sum (moving total) against the middle of the 4th and the 5th 
items. 

4) This process must be continued till the value of the last item is 
taken into account. 

5) Add the first two 4-years moving total and write the sum 
against 3rd year. 

6) Leave the first 4-year moving total and add the next two 4- 
year moving total. This sum must be placed against 4th year. 

7) This process must be continued till all the four- yearly moving 
totals are summed up and centred. 

8) Divide the 4-years moving total centred by 8 and write the 
quotient in a new column. These are our required trend values. 

Note 

The above steps can be applied to get 6-years, 8-years, 
10-years etc., Moving Averages. 

Example 16 

Calculate the 3-yearly Moving Averages of the 
production figures (in mat. tonnes) given below 
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\ear 
Production 



1973 1974 1975 1976 1977 1978 1979 1980 
15 21 30 36 42 46 50 56 



\ear 
Production 



1981 1982 1983 1984 1985 1986 1987 
63 70 74 82 90 05 102 



Solution : 

Calculation of 3-yearly Moving Averages 



Year 


Productiion 

y 


3-yearly 
Moving total 


3-yearly 
Moving average 


1973 


15 


--- 


--- 


1974 


21 


66 


22.00 


1975 


30 


87 


29.00 


1976 


36 


108 


36.00 


1977 


42 


124 


41.33 


1978 


46 


138 


46.00 


1979 


50 


152 


50.67 


1980 


56 


169 


56.33 


1981 


63 


189 


63.00 


1982 


70 


207 


69.00 


1983 


74 


226 


75.33 


1984 


82 


246 


82.00 


1985 


90 


267 


89.00 


1986 


95 


287 


95.67 


1987 


102 


--- 


--- 



Example 17 

Estimate the trend values using the data given below by 
taking 4-yearly Moving Average. 



\ear 
Value 


1974 
12 


1975 

25 


1976 
39 


1977 
54 


1978 
70 


1979 1980 
37 105 


\ear 
Value 


1981 
100 


1982 

82 


1983 
65 


1984 
49 


1985 
34 


1986 1987 
20 7 
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Solution : 



Year 


value 


4 year 


4 year 


Two 4-year 






moving 


moving total 


moving total 






total 


centered 


(Trend values) 


1974 


12 


--- 


--- 


1975 


25 -> 130 


--- 


--- 


1976 


39 -> 188 


318 


39.75 


1977 


54 -> 200 


388 


48.50 


1978 


70 ^266 


466 


58.25 


1979 


37 -> 312 


578 


72.25 


1980 


105 _^ 324 


636 


79.50 


1981 


ioo ^ 352 


676 


84.50 


1982 


82 -> 296 


648 


81.00 


1983 


65 -> 230 


526 


65.75 


1984 


49 ^ 168 


398 


49.75 


1985 


34 -> 110 


278 


34.75 


1986 


20 


--- 


--- 


1987 


7 


--- 


--- 



(iv) Method of Least Squares 

The method of least squares is most widely used in practice. 
The method of least squares may be used to fit a straight line trend. 

The straight line trend is generally expressed by an equation 

y t = a + bx 

Where y is used to represent the trend values, 'a' is the 
intercept, 'b' represents slope of the line which is also known as 
the ratio of growth during a unit of time. The variable x represents 
the time periods. 
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In order to determine the values of the unknown constants 
'a' and V the following equations, known as normal equations, 
are used. 

Ey = na + bZx 

Zxy = dLx + bZx 2 , 

where n represents number of observations (years, months 
or any other period) for which the data are given. 

For derivation of normal equations, refer pages 61 & 62 of 
Chapter 7. To solve the above normal equations and get trend 
values the following are the computational steps. 

Case (i) When the number of years is odd 

1) Denote the years as the X variable and its corresponding 
values as Y. 

2) Assume the middle year as the period of origin and take 
deviations accordingly. Thus EX = 0. 

3) Find EX 2 , EY 2 and EXY. 

4) Substitute EX, EX 2 , EY and EXY in the above normal 
equation and the solve it. 



EX , _ EXY 

n b ~ EX 2 



Hence a = b - ^ r2 



5) Put the values of 'a' and V in the equation and solving for 
each value of X, we get the trend values. 

Case (ii) When the number of years is even 

In this case, assume the variable X as 
_ x - A.M. of two Middle years 

and all other steps are similar to case (i) 
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Note 

If the time lag between consecutive years is not one assume 
the variable X as 



X = 



x - A.M. of two Middle years 



where d 



Difference between tw o consecutiv e years 



Example 18 

Below are the given the figures of production (in 
thousand tonnes) of a sugar factory. 



\ear 


1980 


1981 


1982 1983 1984 


1985 


1986 


Production 


80 


90 


92 83 94 


99 


92 



Fit a straight line trend to these figures by the method 
of least squares and estimate the production in the year of 
1990. 

Solution : 

Given the numbers of years n = l (odd) 



Year 

X 


Production 
('000 tonnes) 
y = Y 


X=*-1983 


X 2 


XY 


1980 


80 


-3 


9 


-240 


1981 


90 


-2 


4 


-180 


1982 


92 


-1 


1 


-92 


1983 


83 











1984 


94 


1 


1 


94 


1985 


99 


2 


4 


198 


1986 


92 


3 


9 


276 




630 





28 


56 
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The equation of the straight line trend is Y t = a + bX 

Substituting the values of EX, EX 2 , EXY and n in normal 
equation, we get 

630 = la + b(0) => a = 90 
56 = a(0) + 6(28) => & = 2 

Hence the equation of the straight line trend is 

Y, = 90 + 2X 
Trend values 

For X = -3, Y t = 90 + 2(-3) = 84 

For X = -2, Y t = 90 + 2(-2) = 86 

For X = -2, Y r = 90 + 2(-l) = 88 

For X = 0, Y t = 90 + 2(0) =90 

For X= 1, Y r = 90 + 2(1) =92 

For X = 2, Y r = 90 + 2(2) = 94 

For X = 3, Y r = 90 + 2(3) =96 

To estimate the production in 1990, substitute X = 7 in the 
trend equation. 

.-. Y 1990 = 90 + 2(7) = 104 x 1000 tonnes 

Example 19 

Fit a straight line trend by the method of least squares 
to the following data. Also predict the earnings for the year 
1988. 



\ear 


1979 1980 1981 1982 1983 1984 1985 1986 


Earnings 


38 40 65 72 69 60 87 95 



Solution : 

Number of years n = 8 (even) 
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year 

X 


Earnings 
(in lakhs) 
v=Y 


v jc-1982.5 
X " .5 


X 2 


XY 


1979 


38 


-7 


49 


-266 


1980 


40 


-5 


25 


-200 


1981 


65 


-3 


9 


-195 


1982 


72 


-1 


1 


-72 


1983 


69 


1 


1 


69 


1984 


60 


3 


9 


180 


1985 


87 


5 


25 


435 


1986 


95 


7 


49 


665 




526 





168 


616 



The equation of the straight line trend is Y t = a + bX 

Substituting the values of EX, EX 2 , EXY, n in Normal 
equation, we get, 

526 = 8a + 6(0) (1) 

616 = a(0) + £(168) (2) 

Solving (1) and (2) we get a = 65.75 and b = 3.667 

Hence the equation of the straight line trend is 

Y 7 = 65.75 + 3.667X 

Year 1979 1980 1981 1982 

Trend values 40.08 47.415 54.749 62.083 

Year 1983 1984 1985 1986 

Trend values 69.417 76.751 84.085 91.419 

To estimate the earnings in 1988, substitute X = 11 in the 
trend equation and we get, 

Y t = 65.75 + 3.667(1 1) = 106.087 

The estimate earnings for the year 1988 are Rs. 106.087 lakhs. 
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Measurement of seasonal variation 

Seasonal variation can be measured by the method of simple 
average. 

Method of simple average 

This method is the simple method of obtaining a seasonal 
Index. In this method the following steps are essential to calculate 
the Index. 

(i) Arrange the data by years, month or quarters as the 
case may be. 

(ii) Compute the totals of each month or each quarter. 

(iii) Divide each total by the number of years for which the 
data are given. This gives seasonal averages (monthly 
or quarterly) 

(iv) Compute average of seasonal averages. This is called 
grand average. 

(v) Seasonal Index for every season (monthly or quarterly) 
is calculated as follows 

Seasonal Average 
Seasonal Index (S.I) = Grand Average x 100 

Note 

(i) If the data is given monthwise, 

Monthly Average 
Seasonal Index = Grand Average x 100 

(ii) If quarterly data is given, 

Quarterly Average 
Seasonal Index = Grand Average x 100 

Example 20 

From the data given below calculate Seasonal Indices. 

183 



year 
Quarter 1984 1985 1986 1987 1988 



40 


42 


41 


45 


44 


35 


37 


35 


36 


38 


38 


39 


38 


36 


38 


40 


38 


40 


41 


42 



Solution 



Quarters 
year I II III IV 



1984 


40 


35 


38 


40 


1985 


42 


37 


39 


38 


1986 


41 


35 


38 


40 


1987 


45 


36 


36 


41 


1988 


44 


38 


38 


42 



Total 212 181 189 201 

Average 42.4 36.2 37.8 40.2 

n A A 42.4+ 36.2 + 37.8 +40.2 ~ 01< 
Grand Average = -r =39.15 

Quarterly Average 
Seasonal Index (S. I) = Grand Average x 100 

Hence, S.I for I Quarter = ^^ x 100 = 108.30 



S.I for II Quarter = ^j= x 100 = 92.54 
S.I for III Quarter = ^^ x 100 = 96.55 
S.I for IV Quarter = ^^ x 100 = 102.68 



Note 

Measurement of cyclical variation, and measurement of 
irregular variation is beyond the scope of this book. 
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EXERCISE 10.3 

1) Draw a trend line by graphic method (free hand) 

year 1995 1996 1997 1998 1999 2000 2001 

Production 20 22 25 26 25 27 30 

2) Draw a trend line by graphic method 

year 1997 1998 1999 2000 2001 

Production 20 24 25 38 60 

3) Obtain the trend values by the method of Semi- Average 

year 1987 1988 1989 1990 1991 1992 1993 

Production 90 110 130 150 100 150 200 
(in tonnes) 

year 1993 1994 1995 1996 1997 1998 1999 2000 

Netprofit 38 39 41 43 40 39 35 25 
(Re lakhs) 

5) Using three year moving averages determine the trend values 
for the following data. 

year 1983 1984 1985 1986 1987 1988 1989 

Production 21 22 23 25 24 22 25 
(in tonnes) 

year 1990 1991 1992 

Production 26 27 26 
(in tonnes) 

6) Below are given figures of production (in thousand tonnes) of 
a sugar factory. Obtain the trend values be 3-year moving 
average. 

year 1980 1981 1982 1983 1984 1985 1986 

Production 80 90 92 83 94 99 92 

7) Using four yearly moving averages calculate the trend values. 

year 1981 1982 1983 1984 1985 1986 1987 

467 502 540 557 



oducti 


on 464 


515 


518 


ar 


1988 


1989 


1990 


oducti 


on 571 


586 


612 
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Calculate the trend values by four year moving averages 
method. 



1978 


1979 


1980 


1981 


1982 


1983 


1984 


ion 614 


615 


652 


678 


681 


655 


717 


1985 


1986 


1987 


1988 








ion 719 


708 


779 


757 









9) Given below are the figures of production of a sugar factory, 
year 1989 1990 1991 1992 1993 1994 1995 

Production 77 88 94 85 91 98 90 

(in tonnes) 

Fit a straight line trend to these figures by the method of least 
squares and estimate trend values. Also estimate the production 
for the year 2000. 

10) Fit the straight line trend, find the trend values and estimate 
the net profit in 2002. 

Year 1992 1993 1994 1995 1996 1997 1998 

Net profit 65 68 59 55 50 52 54 

year 1999 2000 

Net profit 50 42 

1 1) The following data relate to the profit earned by public limited 
company from 1984 to 1989. 

Year 1984 1985 1986 1987 1988 1989 

Profit 10 12 15 16 18 19 

(Rs in 000) 

Fit a straight line trend by the method of least squares to the 
data and tabulate the trend values. 

12) Fit a straight line trend and estimate the trend values. 
Year 1992 1993 1994 1995 1996 1997 1998 1999 
Net profit 47 53 50 46 41 39 40 36 
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13) 



14) 



15) 



Calculate the seasonal indices by the method of simple average 
for the following data 



year 


I quarter 


II quarter 


III quarter 


IV quarter 


1985 
1986 
1987 


68 
65 
68 


62 

58 
63 


61 
66 
63 


63 
61 
67 



Calculate the seasonal indices for the following data by the 
method of simple Average. 







Quarters 






year 


I 


II 


III 


IV 


1994 


78 


66 


84 


80 


1995 


76 


74 


82 


78 


1996 


72 


68 


80 


70 


1997 


74 


70 


84 


74 


1998 


76 


74 


86 


82 



Calculate the seasonal Indices for the following data using 
average method. 







Quarters 






year 


I 


II 


III 


IV 


1982 


72 


68 


80 


70 


1983 


76 


70 


82 


74 


1984 


74 


66 


84 


80 


1985 


76 


74 


84 


78 


1986 


78 


74 


86 


82 



10.4 INDEX NUMBERS 

"An Index Number is a single ratio (usually in percentages) 
which measures the (combined average) change of several variables 
between two different times, places and situations" - Alva. M. Tuttle. 

Index numbers are the devices for measuring differences in 
the magnitude of a group of related variables, over two different 
situations or defined as a measure of the average change in a group 
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of related variables over two different situations. For example, the 
price of commodities at two different places or two different time 
periods at the same location. We need Index Numbers to compare 
the cost of living at different times or in different locations. 

10.4.1 Classification of Index Numbers 

Index Numbers may be classified in terms of what they 
measure. They are 

(i) Price Index Numbers 

(ii) Quantity Index Numbers 

(iii) Value Index Numbers 

(iv) Special purpose Index Numbers 

We shall discuss (i) and (ii). 

10.4.2 Uses of Index Numbers 

(i) Index numbers are used to evolve business policies. 

(ii) Index numbers determine the inflation or deflation in economy. 

(iii) Index numbers are used to compare intelligence of students 
in different locations or for different year. 

(iv) Index numbers serve as economic barometers. 

10.4.3 Method of construction of Index Numbers 

Index Numbers are broadly divided into two groups 
(i) Unweighted Index 
(ii) Weighted Index 

We confine our attention to weighted index numbedrs. 

10.4.4 Weighted Index Numbers 

The method of construction of weighted indices are 

(c) Weighted aggregative method 

(d) Weighted averages of relatives method 
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Weighted Aggregative Index Numbers 

Let p x and/? be the prices of the current year and the base 
year respctively . Let q x and q Q be the quantities of the current year 
and the base year respectively. The formulae for assigning weights 
to the items are : 
(i) Laspeyre's Price Index 

P 01 = y x 100 where w = p q is the weight 

assigned to the items and P is the price index. 

(ii) Paasche's price index 



p p = 



?Po<li 



xlOO 



Here the weights assigned to the items are the current year 
quantities i.e. W =p q x 

(iii) Fisher's price Index 

K = fi^W, = J^x|™ x 100 

Note 

Fisher's price index is the geometric mean of Laspeyre's and 
Paasche's price index numbers. 

Example 21 

Compute (i) Laspeyre's (ii) Paasche's and (iii) Fisher's 
Index Numbers for the 2000 from the following : 



Commodity 


Price 
1990 2000 


Quantity 
1990 2000 


A 


2 


4 


8 


6 


B 


5 


6 


10 


5 


C 


4 


5 


14 


10 


D 


2 


2 


19 


13 
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Solution : 














Commodity 


Price Quantity 
Base Current Base Current 












year year 
Po Pi 


year year 
% % 


Po% 


P^o 


p q t 


PA 


A 


2 4 


8 6 


16 


32 


12 


24 


B 


5 6 


10 5 


50 


60 


25 


30 


C 


4 5 


14 10 


56 


70 


40 


50 


D 


2 2 


19 13 


38 


38 


26 


26 



160 200 103 130 



(i) Laspeyre's Index : P^ = ^ x 100 



200 
160 



xl00= 125 



(ii) Paasche's Index : P„ p , = = x 

01 ?Po<li 



100 



130 
103 



xl00= 126.21 



Cm) Fisher's Index : P^ = ^ x P P , 

= 125.6 
Example 22 

From the following data, calculate price index number 
by (a) Laspeyre's method (b) Paasche's method (iii) Fisher's 
method. 



Commodity 


Base 
Price 


year 
Quantity 


Current year 
Price Quantity 


A 


2 


40 


6 


50 


B 


4 


50 


8 


40 


C 


6 


20 


9 


30 


D 


8 


10 


6 


20 


E 


10 


10 


5 


20 
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Solution : 

Commodity Base year Current year 

Price Qty Price Qty 

p q Pi qi 



p q Piq p qi pa 



A 


2 


40 


6 


B 


4 


50 


8 


C 


6 


20 


9 


D 


8 


10 


6 


E 


10 


10 


5 



50 80 240 100 300 

40 200 400 160 320 

30 120 180 180 270 

20 80 60 160 120 

20 100 50 200 100 



580 930 800 1110 



(i) Laspeyre's Price Index : P 01 = =, x 100 



930 
580 



xl00= 160.34 



^Pili 
(ii) Paasche's Price Index : P., = =; x 100 

01 ZPoQi 



1100 
800 



x 100 =137.50 



(iii) Fisher's Index : P^ = ^ x P P , = 148.48 

10.4.5 Test of adequacy for Index Number 

Index Numbers are constructed to study the relative changes 
in prices, quantities, etc. of one time in comparison with another. 
Several formulae have been suggested for constructing index 
numbers and one should select the most appropriate one in a given 
situation. Following are the tests suggested for choosing an 
appropriate index. 

1) Time reversal test 

2) Factor reversal test 
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Time reversal test 

It is a test to determine whether a given method will work 
both ways in time, forward and backward. When the data for any 
two years are treated by the same method, but with the bases 
reversed, the two index numbers secured should be reciprocals of 
each other so that their product is unity. Symbolically the following 
relation should be satisfied. 

• P xP =1 

• • r 01 A r 10 ± 

(ignoring the factor 100 in each index) where P 01 is the index for 
current period 1 on base period and P 10 is the index for the current 
period on base period 1. 

Factor reversal test 

This test holds that the product of a price index and the 
quantity index should be equal to the corresponding value index. 
The test is that the change in price multiplied by the change in quantity 
should be equal to the total change in value. 



Poi x Qoi = rw, „ (ignoring the factor 100 in each index) 



Op q 

P 01 gives the relative change in price and Q (jl gives the 
relative change in quantity. The total value of a given commodity 
in a given year is the product of the quantity and the price per 
unit. 

^ is the ratio of the total value in the current period to 

^p q 

the total value in the base period and this ratio is called the true 
value ratio. 

Fisher's index is known as Ideal Index Number since it is the 
only index number that satisfies both reversal tests. 
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Example 23 

Calculate Fisher's Ideal Index from the following data 
and verify that it satisfies both Time Reversal and Factor 
Reversal test 



Commodity 


Price 
1985 1986 


Quantity 
1985 1986 


A 


8 


20 


50 


60 


B 


2 


6 


15 


10 


C 


1 


2 


20 


25 


D 


2 


5 


10 


8 


E 


1 


5 


40 


30 



Solution 



Commodity 1985 
Po % 



1986 
Pi Qi 



Pi% p q pa p qi 



A 


8 


50 


20 


B 


2 


15 


6 


C 


1 


20 


2 


D 


2 


10 


5 


E 


1 


40 


5 



60 1000 400 1200 480 

10 90 30 60 20 

25 40 20 50 25 

8 50 20 40 16 

30 200 40 150 30 



1380 510 1500 571 



Fisher's Ideal Index 



'f^xf™ xlOO 



cm 



1380 x 1500 xl0Q 
V 510 571 

= 2.6661 x 100 = 266.61 



Time reversal test 

Test is satisfied when P 01 x P 10 = 1 
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01 



10 



' E A?o. x L A?i 



S Po4o S A)4i 



z Piqi z Piqi 



1380 1500 
510 571 



571 v 510 

A 



1500 1380 



P 01 xP 



10 



[1380 v 1500 V J571 510 

"A _ „ A , _ „ „ A ~ 



510 571 1500 1380 
t = 1 
Hence Fisher's Ideal Index satisfies Time reversal test. 

Factor reversal test 



Test is satisfied when P„. xQ m = 



-01 



E Atfo 



Q 



01 



'%Po. x E ?iA 



Zq P ^q Q p 



OFl 



571 1500 
510 1380 



P~xQ = 



01 



11380 x 1500 x 571 x 1500 



V 510 
1500 



571 



510 1380 



510 Zp q 

Hence Fisher's Ideal Index satisfies Factor reversal test. 

Example 24 

Compute Index Number using Fisher's formula and show 
that it satisfies time reversal test and factor reversal test. 



Commodity 


Base year 
Price Quantity 


Current year 
Price Quantity 


A 


10 


12 


12 


15 


B 


7 


15 


5 


20 


C 


5 


24 


9 


20 


D 


16 


5 


14 


5 
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Solution : 


















Commodity 


Base 

year 


Current 
year 












Po 


% 


Pi 


q. 


Piq 


p q 


PA 


Pctfi 


A 


10 


12 


12 


15 


144 


120 


180 


150 


B 


7 


15 


5 


20 


75 


105 


100 


140 


C 


5 


24 


9 


20 


216 


120 


180 


100 


D 


16 


5 


14 


5 


70 


80 


70 


80 



505 425 530 470 



Fisher's Ideal Index = Jzr^-x -^- x 100 



5Q5 x-^- x 100= 115.75 

425 470 



Time reversal test 

Test is satisfied when P 01 x P 10 = 1 



01 



ZPoVo £/V7i V425 470 



p = pPi^o x z Affi = /505 x 530 



p = fegi x £ Po^o = / 470 x 425 



10 



EjOj^j Sp^o V530 505 



P„,xP,„ = J^x53gx4??x 425 



oi 10 -y 425 "470 "530 "505 
= VT= 1 
Hence Fisher's Ideal Index satisfies Time Reversal Test. 

Factor reversal test 

Test is satisfied when P„, xO„, = — 

01 01 Zp q 
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q _ / ggiPo x ggiPi = / 470 x 425 



01 ^ 0jPo E ?oP] V530 505 



P xO - 505 x 530 .. 470 530 
01 ^ ^' 425 470 425 505 



530 Z PA 



425 Sp ^ 
Hence Fisher's Ideal Index satisfies Factor reversal test. 

10.4.6 Cost of Living Index (CLI) 

Cost of living index numbers are generally designed to 
represent the average change over time in the prices paid by 
the ultimate consumer for a specified quantity of goods and 
services. Cost of living index number is also known as 
Consumer price index number 

It is well known that a given change in the level of prices 
(retail) affects the cost of living of different classes of people in 
different manners. The general index number fails to reveal this. 
Therefore it is essential to construct a cost of living index number 
which helps us in determining the effect of rise and fall in prices on 
different classes of consumers living in different areas. It is to be 
noted that the demand for a higher wage is based on the cost of 
living index. The wages and salaries in most countries are adjusted 
in accordance with the cost of living index. 

10.4.7 Methods of constructing cost of living index 

Cost of living index may be constructed by the following 
methods. 

(i) Aggregate expenditure method or weighted aggregative 
method 

(ii) Family budget method 
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Aggregate expenditure method 

In this method, the quantities of commodities consumed by 
the particular group in the base year are used as the weights. On 
the basis of these weights, aggregate (total) expenditure in current 
year and base year are calculated and the percentage of change is 
worked out . 

Cost of Living Index (C.L.I) = ^^ x 100 

This method is the most popular method for constructing cost 
of living index and the method is same as Laspeyre's price index. 

Family budget method 

In this method, the value weights obtained by multiplying 
prices by quantities consumed (i.e. p Q q Q ) are taken as weights. To 
get the cost of living index, find the sum of respective products of 
price relatives and value weights and then divide this sum by the 
sum of the value weights. 

ypy 
Cost of living Index = where 

Pi 
P = — x 100 is the price relative and 

Po 
V = p Q q Q is the value weight for each item. 

This method is same as the Weighted average of price relative 
method. 

10.4.8 Uses of cost of living index number 

(i) The cost of living index number is mainly used in wage 
negotiations and wage contracts. 

(ii) It is used to calculate the dearness allowance for the 
employees. 
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Example 25 

Calculate the cost of living index by aggregate 
expenditure method 



Commodity 


Quantity 
2000 


Price (Rs) 


2000 


2003 


A 


100 


8 


12.00 


B 


25 


6 


7.50 


C 


10 


5 


5.25 


D 


20 


48 


52.00 


E 


65 


15 


16.50 


F 


30 


19 


27.00 



Solution 



Commodity Quantity Price 

2000 2000 2003 
4o Po Pi 



PiVo 



Po% 



A 


100 


8 


12.00 


1200.00 


800 


B 


25 


6 


7.50 


187.50 


150 


C 


10 


5 


5.25 


52.50 


50 


D 


20 


48 


52.00 


1040.00 


960 


E 


65 


15 


16.50 


1072.50 


975 


F 


30 


19 


27.00 


810.00 


570 



4362.50 3505 



C.L.I = 



^PSo 

4362.50 
3505 



xlOO 



x 100 = 124.46 



198 



Example 26 

Construct the cost of living Index Number for 2003 on 
the basis of 2000 from the following data using family Budget 
method. 



Items 


Price 


Weights 


2000 


2003 


Food 


200 


280 


30 


Rent 


100 


200 


20 


Clothing 


150 


120 


20 


Fuel & lighting 


50 


100 


10 


Miscellaneous 


100 


200 


20 



Solution : 

Calculation of CLI by family budget method 



Items 



weights P=— xlOO PV 
Pa 
V 



Food 


200 


280 


30 


140 


4200 


Rent 


100 


200 


20 


200 


4000 


Clothing 


150 


120 


20 


80 


1600 


Fuel & Lighting 50 


100 


10 


200 


2000 


Misc. 


100 


200 


20 


200 


4000 



100 



15800 



Cost of living index = ^- = ±|2QQ = 158 

Hence, there is 58% increase in cost of living in 1986 
compared to 1980. 
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EXERCISE 10.4 



1) 



2) 



3) 



4) 



Compute (i) Laspeyre's (ii) Paasche's and (iii) Fisher's index 
Numbers 



Commodity 


Price 


Quantity 


Base 


Current 


Base 


Current 




year 


year 


year 


year 


A 


6 


10 


50 


50 


B 


2 


2 


100 


120 


C 


4 


6 


60 


60 


D 


10 


12 


30 


25 



Construct the price index number from the following data by 

applying 

(i) Laspeyre's (ii) Paasche's (iii) Fisher's method 



Commodity 


1999 


1998 


Price 


Quantity 


Price 


Quantity 


A 
B 
C 
D 


4 
6 

5 

2 


6 
5 

10 
13 


2 
5 
4 

2 


8 

10 
14 
19 



Compute (a) Laspyre's (b) Paasche's (c) Fisher's method 
of index numbers for 1990 from the following : 



Commodity 


Price 


Quantity 


1980 


1990 


1980 


1990 


A 
B 
C 
D 


2 
5 
4 

2 


4 
6 

5 

2 


8 

10 
14 
19 


6 
5 

10 
13 



From the following data calculate the price index number by 
(a) Laspeyre's method (b) paasche's method 
(c) Fisher's method 
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5) 



6) 



Commodity 


Base 


year 


Current year 


Price 


Quantity 


Price 


Quantity 


A 


5 


25 


6 


30 


B 


10 


5 


15 


4 


C 


3 


40 


2 


50 


D 


6 


30 


8 


35 



Using the following data, construct Fisher's Ideal index and 
show that it satisfies Factor Reversal test and Time Reversal 
test. 



Commodity 


Price 


Quantity 


Base 


Current 


Base 


Current 




year 


year 


year 


year 


A 


6 


10 


50 


56 


B 


2 


2 


100 


120 


C 


4 


6 


60 


60 


D 


10 


12 


30 


24 


E 


8 


12 


40 


36 



Calculate Fisher's Ideal Index from the following data and show 
how it satisfies time reversal test and factor reversal test. 



Commodity 


Base year 
(1997) 


Current year 
(1998) 


Price 


Quantity 


Price 


Quantity 


A 


10 


10 


12 


8 


B 


8 


12 


8 


13 


C 


12 


12 


15 


8 


D 


20 


15 


25 


10 


E 


5 


8 


8 


8 


F 


2 


10 


4 


6 
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7) 



Construct cost of living index for 2000 taking 1999 as the base 
year from the following data using Aggregate Expenditure 
method. 



Commodity 


Quantity (kg.) 
1999 


Price 


1999 


2000 


A 


6 


5.75 


6.00 


B 


1 


5.00 


8.00 


C 


6 


6.00 


9.00 


D 


4 


8.00 


10.00 


E 


2 


2.00 


1.80 


F 


1 


20.00 


15.00 



Calculate the cost of living Index Number using Family Budget 
method 



Commodity 


A 


B 


C 


D 


E 


F 


G 


H 


Quantity in 
Base year (unit) 


20 


50 


50 


20 


40 


50 


60 


40 


Price in 

Base year (Rs.) 


10 


30 


40 


200 


25 


100 


20 


150 


Price in 

current year (Rs) 


12 


35 


50 


300 


50 


150 


25 


180 



9) Calculate the cost of living index number using Family Budget 
method for the following data taking the base year as 1995 



Commodity 


Weight 


Price ( 


Der unit) 


1995 


1996 


A 


40 


16.00 


20.00 


B 


25 


40.00 


60.00 


C 


5 


0.50 


0.50 


D 


20 


5.12 


6.25 


E 


10 


2.00 


1.50 
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10) From the data given below, construct a cost of living index 
number by family budget method for 1986 with 1976 as the 
base year. 



Commodity 


P 


Q 


R 


S 


T 


U 


Quantity in 1976 
Base year 


50 


25 


10 


20 


30 


40 


Price per 

unit in 1976 (Rs.) 


10 


5 


8 


7 


9 


6 


Price per 

unit in 1986 (Rs.) 


6 


4 


3 


8 


10 


12 



10.5 STATISTICAL QUALITY CONTROL (SQC) 

Every product manufactured is required for a specific 
purpose. It means that if the product meets the specifications 
required for its rightful use, it is of good quality and if not, then 
the quality of the product is considered to be poor. 

It is a well known fact that all repetitive process no matter 
how carefully arranged are not exactly identical and contain some 
variability. Even in the manufacture of commodities by highly 
specialised machines it is not unusual to come across differences 
between various units of production. For example, in the 
manufacture of corks, bottles etc. eventhough highly efficient 
machines are used some difference may be noticed in various 
units. If the differences are not much, it can be ignored and the 
product can be passed off as within specifications. But if it is 
beyond certain limits, the article has to be rejected and the cause 
of such variation has to be investigated. 

10.5.1 Causes for variation 

The variation occurs due to two types of causes namely 
(i) Chance causes (ii) Assignable causes 
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(i) Chance causes 

If the variation occurs due to some inherent pattern of variation 
and no causes can be assigned to it, it is called chance or random 
variation. Chance Variation is tolerable, permissible inevitable and 
does not materially affect the quality of a product. 

(ii) Assignable causes 

The causes due to faulty process and procedure are known 
as assignable causes, the variation due to assignable causes is of 
non-random nature. Chance causes cannot detected. However 
assignable causes can be detected and corrected. 

10.5.2 Role and advantages of SQC 

The role of statistical quality control is to collect and analyse 
relevant data for the purpose of detecting whether the process is 
under control or not. 

The value of quality control lies in the fact that assignable 
causes in a process can be quickly detected. Infact the variations 
are often discovered before the product becomes defective. 

SQC is a well accepted and widespread process on the basis 
of which it is possible to understand the principles and techniques 
by which decisions are made based on variation. 

Statistical quality control is only diagonstic. It tells us 
whether the standard is being maintained or not. The remedial 
action rests with the technicians who employ techniques for the 
maintenance of uniform quality in a continuous flow of 
manufactured products. 

The purpose for which SQC are used are two fold namely, 
(a) Process control, (b) Product control. 

In process control an attempt is made to find out if a particular 
process is within control or not. Process control helps in studying 
the future performance. 
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10.5.3 Process and Product control 

The main objective in any production process is to control 
and maintain quality of the manufactured product so that it conforms 
to specified quality standards. In otherwords, we want to ensure 
that the proportion of defective items in the manufactured product 
is not too large. This is called process control and is achieved 
through the technique of control charts. 

On the otherhand, by product control we mean controlling 
the quality of the product by critical examination at strategic points 
and this is achieved through product control plans pioneered by 
Dodge and Romig. Product control aims at guaranteeing a certain 
quality level to the consumer regardless of what quality level is being 
maintained by the producer. In otherwords, it attempts to ensure 
that the product marketed by the sale department does not contain 
a large number of defective items. 

10.5.4 Control Charts 

The statistical tool applied in process control is the control 
chart. Control charts are the devices to describe the patterns of 
variation. The control charts were developed by the physicist, 
Walter A. Stewart of Bell Telephone Company in 1924. He 
suggested that the control chart may serve, first to define the goal 
or standard for the process that the management might strive to 
attain. Secondly, it may be used as an instrument to attain that goal 
and thirdly, it may serve as a means of judging whether the goal is 
being achieved. Thus, control chart is an instrument to be used in 
specification, production and inspection and is the core of statistical 
quality control. 

A control chart is essentially a graphic device for presenting 
data so as to directly reveal the frequency and extent of variations 
from established standards or goals. Control charts are simple to 
construct and easy to interpret and they tell the manager at a glance 
whether or not the process is in control, i.e. within the tolerance 
limits. 
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In general a control chart consists of three horizontal lines 

(i) A central line to indicate the desired standard or level 
of the process (CL) 

Upper control limit (UCL) and 

Lower control limit (LCL) 



(ii) 

(iii) 



3 



Outline of a control chart 



out of control 



T 



-UCL 



+3 a 



T 
I 



-CL 



3o 



-LCL 



out of control 



-4- 



-4- 



-4- 



-4- 



-4- 



-4- 



12345678 
Sample Numbers 

(Fig. 10.9) 

From time to time a sample is taken and the data are plotted 
on the graph. If all the sample points fall within the upper and lower 
control limits, it is asumed that the process is "in control" and only 
chance causes are present. When a sample point falls outside the 
control limits, it is assumed that variations are due to assignable 
causes. 

Types of Control Charts 

Broadly speaking, control charts can be divided under two 
heads. 

(i) Control charts of Variables 

(ii) Control charts of Attributes 

Control charts of variables concern with measurable data on 
quality characteristics which are usually continous in nature. Such 

type of data utilises X and R chart. 
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Control charts of attirbutes, namely c, np and p charts 
concern with the data on quality characteristics, which are not 
amenable to measurement or attributes (prodcut defective or non 
defective) 

In this chapter, we consider only the control charts of 

variables, namely X chart and R chart. 

R- Chart (Range chart) 

The R chart is used to show the variability or dispersion of 
the quality produced by a given process. R chart is the companion 
chart to the X chart and both are usually required for adequate 
analysis of the production process under study. The R chart is 
generally presented along with the X chart. The general procedure 
for constructing the R chart is similar to that for the X chart. The 
required values for constructing the R chart are : 

(i) The range of each sample, R. 

(ii) The Mean of the sample ranges, R 

(iii) The control limits are set at 

U.C.L = D 4 R 

L.C.L =D 3 R 
The values of D 4 and D 3 can be obtained from tables. 

X Chart 

The X chart is used to show the quality averages of the 
samples drawn from a given process. The following values must 
first be computed before an X chart is constructed. 

1) Obtain the mean of each sample X : i = 1, 2 ... n 

2) Obtain the mean of the sample means 
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i.e 



^_ X 1+ X 2 +... + X„ 



3) 



where n is the total number of observations 
The control limits are set at 
U.C.L. = X+ ^R 



ZR 



LCL = X-A 9 R, where R = 



where R are 



the sample ranges. 

The values of A, for different n can be obtained from the tables. 

Example 28 

The following data relate to the life (in hours) of 10 
samples of 6 electric bulbs each drawn at an interval of one 
hour from a production process. Draw the control chart for 

X and R and comment. 



Sample No. 


Life time (in hours) 


1 


620 


687 


666 


689 


738 


686 


2 


501 


585 


524 


585 


653 


668 


3 


673 


701 


686 


567 


619 


660 


4 


646 


626 


572 


628 


631 


743 


5 


494 


984 


659 


643 


660 


640 


6 


634 


755 


625 


582 


683 


555 


7 


619 


710 


664 


693 


770 


534 


8 


630 


723 


614 


535 


550 


570 


9 


482 


791 


533 


612 


497 


499 


10 


706 


524 


626 


503 


661 


754 



(Given for n = 6, A, = 0.483, D 3 = 0, D 4 = 2.004) 
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Solution : 



Sample No. 


Total 


Sample 


Sample 






Mean X 


Range R 


1 


4086 


681 


118 


2 


3516 


586 


167 


3 


3906 


651 


134 


4 


3846 


641 


171 


5 


4080 


680 


490 


6 


3834 


639 


200 


7 


3990 


665 


236 


8 


3622 


604 


188 


9 


3414 


569 


309 


10 


3774 


629 


251 


Total 




6345 


2264 



Central line X = mean of the sample means = 634.5 
R = mean of the sample ranges = 226.4 

U.C.L. =X+A2R 

= 634.5 + 0.483 x 226.4 
= 634.5 + 109.35 = 743.85 

L.C.L. =X-A2R 

= 634.5 - 0.483 x 226.4 
= 634.5- 109.35 = 525.15 

Central line R = 226.4 

U.C.L. = D 4 R = 2.004 x 226.4 

= 453.7056 
L.C.L. = D 3 R =0x226.4 = 
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X Chart 
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R Chart 
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(Fig. 10.10) 
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Sample No. 
(Fig. 10.11) 
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Conclusion : 

Since one of the points of the sample range is outside the 
UCL of R chart, the process is not in control. 

Example 29 

The following data shows the value of sample mean X 
and the range R for ten samples of size 5 each. Calculate the 
values for central line and control limits for mean chart and 
range chart and determine whether the process is in control 



Sample no. 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


Mean X 


11.2 


11.8 


10.8 


11.6 


11.0 


9.6 


10.4 


9.6 


10.6 


10.0 


Range (R) 


7 


4 


8 


5 


7 


4 


8 


4 


7 


9 



(Given for n = 5, ^ = 0.577, D 3 = D 4 = 2.115) 

Solution : 

Control limits for X chart 

X = - EX 
n 

= ^(11-2 + 11.8 + 10.8 + ...+ 10.0) = 10.66 

R = J-ZR=-f7r(63) = 6.3 
n 10 

U.C.L =X+A2R 

= 10.66 + (0.577x6.3) =14.295 

L.C.L. =X-A 2 R 

= 10.66 -(0.577x6.3) = 7.025 

CL = Central line = X = 10.66 

Range chart 

U.C.L. =D,R =2.115x6.3 = 13.324 



L.C.L. = D 3 R =0 



CL. 



R = 6.3 
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X Chart 
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(Fig. 10.12) 
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Conclusion : 

Since all the points of sample mean and range are within the 
control limits, the process is in control. 
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Samples 


1 2 3 


4 


5 


6 


X 


34 31.6 30.8 


33 


35 


33.: 


R 


4 4 2 


3 


5 


2 


Samples 


11 12 13 


14 


15 


16 


X 


35.8 38.4 34 


35 


38.8 


31.( 


R 


4 4 14 


4 


7 


5 



EXERCISE 10.5 

1) The following are the X and R values for 20 samples of 5 
readings. Draw X chart and R chart and write your 
conclusion. 

7 8 9 10 

35 33,2 33 32.6 33.8 37.8 
5 13 19 6 

17 18 19 20 

33 28.2 31.8 35.6 
5 3 9 6 
(Given for n = 5, A 2 = 0.58, D 3 = D 4 = 2.12) 

2) From the following, draw X and R chart and write your 
conclusion. 

Sample no. 123456789 10 

140 138 139 143 142 136 142 143 141 142 

143 143 133 141 142 144 147 137 142 137 

137 143 147 137 145 143 137 145 147 145 

134 145 148 138 135 136 142 137 140 140 

135 146 139 140 136 137 138 138 140 132 

Sample no. 11 12 13 14 15 16 17 18 19 20 

137 137 142 137 144 140 137 137 142 136 

147 146 142 145 142 132 137 142 142 142 

142 142 139 144 143 144 142 142 143 140 

137 142 141 137 135 145 143 145 140 139 

135 140 142 140 144 141 141 143 135 137 
(Given for n = 5, A 2 = 0.58, D 3 = 0, D 4 = 2.12) 

3) From the following data construct X and R chart and write 
your conclusion 
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Sample no. 



Sample no. 



1 


2 


3 


4 


5 


6 


7 


8 


9 


46 


41 


43 


37 


37 


37 


44 


35 


37 


40 


42 


40 


40 


40 


38 


39 


39 


44 


48 


49 


46 


47 


46 


49 


43 


48 


48 


10 


11 


12 


13 


14 


15 


16 


17 


18 


45 


48 


36 


40 


42 


38 


47 


42 


47 


43 


44 


42 


39 


40 


40 


44 


45 


42 


49 


48 


48 


48 


48 


48 


49 


37 


49 



(Given for n = 3, A 2 =1.02, D 3 = 0, D 4 = 2.58) 

EXERCISE 10.6 

Choose the correct answer 

1) A time series is a set of data recorded 

(a) periodically (b) at equal time intervals 

(c) at successive points of time (d) all the above 

2) A time series consists of 

(a) two components (b) three components 

(c) four components (d) none of these 

3) The component of a time series attached to long term 
variation is termed as 

(a) cyclic variations (b) secular trend 

(c) irregular variation (d) all the above 

4) The component of a time series which is attached to short 
term fluctuations is 

(a) seasonal variation (b) cyclic variation 

(c) irregular variation (d) all the above 

5) Cyclic variations in a time series are casued by 

(a) lock out in a factory (b) war in a country 

(c) floods in the states (d) none of these 
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6) The terms prosperity, recession depression and recovery are 
in particular attached to 

(a) Secular trend (b) seasonal fluctuation 

(c) cyclic movements (d) irregular variation 

7) An additive model of time series with the components T, S, 
C and I is 

(a)Y = T + S + C-I (b)Y = T + SXC + I 

(c)Y = T + S + C + I (d)Y = T + S + CXI 

8) A decline in the sales of ice cream during November to March 
is associated with 

(a) Seasonal variation (b) cyclical variation 

(c) random variation (d) secular trend 

9) Index number is a 

(a) measure of relative changes 

(b) a special type of an average 

(c) a percentage relative 

(d) all the above. 

10) Index numbers are expressed 

(a) in percentages (b) in ratios 

(c) in terms of absolute value (d) all the above 

11) Most commonly used index numbers are 

(a) Diffusion index number (b) price index number 

(c) value index number (d) none of these 

12) Most frequently used index number formulae are 

(a) weighted formulae (b) Unweighted formulae 

(c) fixed weighted formulae (d) none of these 

13) Laspeyre's index formula uses the weights of the 

(a) base year quantities (b) current year prices 

(c) average of the weights of number of years 

(d) none of these 

215 



14) The weights used in Paasche's formula belong to 

(a) the base period (b) the current period 

(c) to any arbitrary chosen period (d) none of these 

15) Variation in the items produced in a factory may be due to 
(a) chance causes (b) assignable causes 

(c) both (a) and (b) (d) neither (a) or (b) 

16) Chance variation in the manufactured product is 
(a) controlable (b) not controlable 
(c) both (a) and (b) (d) none of these 

17) The causes leading to vast variation in the specification of a 
product are usually due to 

(a) random process (b) assignable causes 

(c) non-traceable causes (d) all the above 

18) Variation due to assignable causes in the product occur due to 
(a) faulty process (b) carelessness of operators 
(c) poor quality of raw material (d) all the above 

19) Control charts in statistical quality consist of 

(a) three control lines 

(b) upper and lower control limits 

(c) the level of process 

(d) all the above 

20) The range of correlation co-efficient is 

(a) to °° (b) -°° 10 °° 

(c) -1 to 1 (d) none of these 

21) If X and Y are two variates, there can be atmost 

(a) one regression line (b) two regression lines 

(c) three regression lines (d) none of these 
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22) In a regression line of Y on X, the variable X is known as 
(a) independent variable (b) dependent variable 
(c) both (a) and (b) (d) none of these 

23) Scatter diagram of the variate values (X, Y) give the idea 
about 

(a) functional relationship (b) regression model 

(c) distribution of errors (d) none of these 

24) The lines of regression intersect at the point 
(a)(X,Y) (b)(X,Y) 

(c) (0, ) (d) none of these 

25) The term regression was introduced by 

(a) R.A.Fisher (b) Sir Francis Galton 

(c) Karl pearson (d) none of these. 
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